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fn (54) title: compositions and methods for the therapy and diagnosis of colon cancer 

^ (57) Abstract: Compositions and methods for the therapy and diagnosis of cancer, such as colon cancer, are disclosed. Composi- 
tions may comprise one or more colon tumor proteins, immunogenic portions thereof, or polynucleotides that encode such portions. 
Alternatively, a therapeutic composition may comprise an antigen presenting cell that expresses a colon tumor protein, or a T cell 
that is specific for cells expressing such a protein. Such compositions may be used, for example, for the prevention and treatment of 
diseases such as colon cancer. Diagnostic methods based on detecting a colon tumor protein, or mRNA encoding such a protein, in 
a sample are also provided. 
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TIONS AND METHODS FOR THE THERAPY AND DIAGNOSIS OF 

COLON CANCER 



TECHNICAL FIELDOF THE INVENTION 

The present invention relates generally to therapy and diagnosis of 
cancer, such as colon cancer. The invention is more specifically related to polypeptides 
comprising at least a portion of a colon tumor protein, and to polynucleotides encoding 
such 5 polypeptides. Such 'polypeptides and polynucleotides may be used in vaccines and 
pharmaceutical" compositions 'for prevention and treatment of colon malignancies, and 
for the diagnosis and monitoring of such cancers. ' 



10 BACKGROUND OF THE INVENTION 

1 ; ! Cancer is a significant health' problem throughout the world. Although 

; Advances have been made in detection and therapy of cancer, no vaccine or other 
universally successful method for prevention or treatment is currently available. 
Current therapies,' which ate generally based on a "combination of chemotherapy or 

1 5 surgery and radiation, cWtihue io $6Wmia6qiuiteiii' many patients. 

'•I (Ti G3£ f : Colon caneef is the second most frequently diagnosed malignancy in the 
United States as well as the second most common cause of cancer death. The five-year 
survival rate for patients with colorectal cancer detected in an early localized stage is 
92%; unfortunately, only 37% 6f colorectal cancer is diagnosed at this stage. The 

20 survival rate drops to 64% if the cancer is allowed to spread to adjacent organs or 
lymph nodes, and to 7% in patients with distant metastases. 

The prognosis of coiori' cancer is directly related to the degree of 
penetration of the tumor through tlU bowel wall and the presence or absence of nodal 
involveineht, consequently early detection and treatment : are especially important. 

25 Currently, diagnosis is aided by the use of screening assays for fecal occult blood, 

-f. ' 

sigmoidoscopy, colonoscopy and double contrast barium enemas. Treatment regimens 
are determined by the type and' stage of the cancer, and include surgery, radiation 
therapy and/or chemotherapy! Recurrence following surgery (the most common form 
of therapy) is a major problem and is often the ultimate cause of death. 
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Within related aspects, pharmaceutical compositions are provided that 
comprise: (a) an antigen presenting,. cell that expresses a polypeptide as described 
above and(b) an immunostknurant. | 

The present invention further provides, in, pther aspects, fusion proteins 
5 that comprise at least one polypeptide aSi described above** as well as polynucleotides 
encoding, such fusion proteins, typically, in the, form,;pf pharmaceutical compositions, 
e.g., vaccine compositions, comprising a physiologically acceptable, farrier and/or an 
i immunostimulant. The [ fusions! proteins . may: comprise multiple ^immunogenic 
: ih polypeptides or poi^pi^vpiante.thfjreo^^dgscri^here^ and^ay r further comprise 
10, one or more polypeptide segments for facilitating the expression^ purification = and/or 

immunogenicity of the pplypeptiide(s). Of , '■. '';>•;! .:>>. :k.. ; j 

Within further aspects,... the present invention provides methods for 
stimulating an immune response in a patient, preferably a T t cell re sppnse in a human 
patient, comprising administering a pharmaceutical composition, described herein. The 
J 5 r patient may he afflicted with cplon cancer, in which case the methods provide treatment 
; for the disease, 'pr r patient . considered at risk , for such a disease may be treated 

prophylactically. .Kjirfao :>5c-.i:c ..^. ..t.- 

, Within further .aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
20 patient a pharmaceutical composition as recited above. The patient may be afflicted 
with colon cancer, in which case the methods provide treatment for the disease, or 
i - patient considered at risk for such a disease may be, treated prophylactically. 

; : The present invention further provides, y4thin omer aspects, methods for 

removing tumor cells from a biological sample, comprising exacting a biological 
25 sample with T cells that specifically react with a polypeptide r pf the present invention, 
wherein the step of contacting is ; peijforraed vmder,pondi^ons^and for a time sufficient to 
permit the removal of cells expressing, the protein from the sample,. > 

Within : related aspects, methods are provide^ -for . inhibiting the 
development of a cancer in a patiem> comprising administering to a patient a biological 
30 sample treated as described above. ; . , 
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Methods are further provided, within other aspects, for stimulating 
and/or expanding T cells specific for a polypeptide 'of the present invention, comprising 
contacting T cells with one or more' of: (i) a' polypeptide as described above; (ii) a 
polynucleotide encoding such a polypeptide:;* 'and/or (iii) aii antigen presenting cell that 
5 expresses such a polypeptide; under conditions and for a time sufficient to permit the 
stimulation and/or expansibri of T ceils. Isolated T cell populations comprising T cells 
prepared as described above are also provided! sr cn 

'Within further aspects; the present invention provides methods for 
inhibiting the development of a cancer ih a patient, comprising administering to a 
1 0 patient ah effective aniouht of a T cell population as described above. : 0 ' > 

Tlife r present 1 invention further i provides 'methods for inhibiting the 
development of a cancer iri a patient, Comprising thfe steps of: (a) incubating CD4 + 
and/or CD8 + T cells isolated from a patient with ohe or more of: (i) a polypeptide 
comprising at least an immunogenic portion of polypeptide disclosed herein; (ii) a 
15 polynucleotide encoding sucli a 1 polypeptide; and (iii) an antigen-presenting cell that 
expressed such a' polypeptide; and (b) adhiinistei-irig to th^ jpatieht an effective amount 
' of the proliferated T cells, aM therefj^ rnruliitin^ %& deS/elopmeht of a cancer in the 
z " patient' j Proliferatec! cells may, but need not,' be cloiied prior to administration to the 
patient. * : 

20 Within further aspects, the present invention provides methods for 

determining the presence or absence of a cancer, 'preferably a colon cancer, in a patient 
comprising: (a) contacting £ 'biological sample obtained from a patient with a binding 
agent that binds to a polypeptide as recited above; (b) detecting in the sample an 
amount of polypeptide that binds to the binding agent; and (c) comparing the amount of 

25 polypeptide with a predetermined cut-off value, and therefrom determining the presence 
or absence of a cancer ih the patient. Within preferred embodiments, the binding agent 
is an antibody, more preferably a monoclonal ^tibody. 

The present invention also provides, within other aspects, methods for 
monitoring the progression of a cancer in a patient. Such methods comprise the steps 

30 -of: (a) contacting a biological sample obtained from a patient at a first point in time 
with a binding agent that binds to a polypeptide as recited above; (b) detecting in the 
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sample an amount of polypeptide tfrat binds, to the binding agent; (c) repeating steps (a) 
and (b) , using a. biological sample obtained from, the patient at a subsequent point in 
time; and. (d) comparing the amount of polypeptide detected, in step (c) with the amount 
detected in step, (b) and therefrom monitoring the progression of the cancer in the 
patient. ; 

The present invention furtlier provides, within other aspects, methods for 
determining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sainple, e.g., rancor sample, serum .sample, etc., obtained from a 
patient with an .^igo^ucl^e , : ^ t hybridizes^ a ,1^1^0160^6 .that, .encodes a 
polypeptide of the present invention;-^) detecting in the sample a. level of a 
polynucleotide, preferably . m^A, , that hybridizes. „ to. the- oligonucleotide; and (c) 
comparing the level of polynucleotide that . hybridizes to the oligonucleotide with a 
predetermined cut-o/f r yalue, and therefrom determining ( the presence or absence of a 
cancer in the patient. Within certain embodiments, the amount of mRNA is detected 
via polymerase .chain reaction usjpg, for example, at least one oligonucleotide primer 
that hybrid^. q to a .|»^ucjeptid^^. encoding ,a. polypeptide as recited above, or a 
complement r of -such, a, fl^^HmlS^^F 1 ^ other .emtodjipcnfe, the ..amount of 
mRNA is,.deteqted using a hybridizatipn technique, emplpying an oligonucleotide probe 
that hybridizes to a polynucleotide that encodes a polypeptide as recited above, or a 
complement of such a polynucleotide. 

In related aspects, xnethods are provided for monitoring the progression 

V * t 

of a cancer in a patient, comprising the steps f o|: £a) contacting a biological sample 
obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that 
encodes a polypeptide of ,the present invention; (b) detecting in the sample an amount of 
a polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) 
using a biological, sample obtained fr,om the patient at a subsequent point in time; and 
(d) comparing the amount of polynucleotide, -detected in step (c) with the amount 
detected in step {bYajid therefrom monitoring the progression of the cancer in the 
patient . . 



30 ■■ , Within f^er aspects, Ae present invention prpvides antibodies, such as 

; , .mpnoclqnal antibodies, tjiat bind to a polypeptide, as described .above, as well as 
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diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more 
oligonucleotide probes or primers as described' above are also provided. ' 

' These arid other aspects of the present invention will become apparent 
upon reference to the following' detailed description. All references disclosed herein are 
hereby incorporated by reference' in their entirety as if each was incorporated 
individually. 



BRIEF DESCRIPTION OF THESEQUENCE IDENTIFIERS 

\ ~ sEQrbN^ 

SEQ It) NO: 2 is me aetermiried cDNA sequence for 54 104.1 which 

10 • yiafes'hbmolo'^ wi<bfPAfc ! 75Nl3 : on chromblome Xq2l.l. 

.5 ^ \w SEQ ID NO: 3 s is'tfie determined cDNA sequence for 53978.1 which 
shares homology with Glutamihe:fructose-6 phosphate amidotransferase. 
! ; ' SEQ ID NO: 4 is the determined cDN A sequence for 54184.1 which 

shares homology with Colon Kruppel-like factor. ' 
l5 SEQ ID NO: 5 is the determined cDNA- sequence for 54149.1 which 

shares homology with cDNA 1 FtJlMf fik;'mh ! e^#f0et.482? 
. / i I i r ' SEQ ID NO: 6 is the determined cDNA sequence for 54034.1 . 

SEQ ID NO: 7 is the determined cDNA sequence for 54085.1 which 

i 

shares homology with Human beta 2 gene. 
20 ' SEQ ID NO: 8 is the determined cDNA sequence for 53948.1 which 

' shares homblogy with 12pl2 BAC RPC11 1-267J23. 

SEQ ID NO: 9 is the determined cDN A sequence for 54026.1 which 
shares homology with done 164F3 on chromosome X2q21 .33-23. 

SEQ ID NO: 10 is the determined cDNA sequence for 53907.1 which 
25 shares hornolbgy with Lysyl hydroxylase isoform 2. 

'■'■>■ ' ! SEQ ID NO: 11' is the determined cDNA sequence for 54066.1 which 

shafes homology with Mucin l'l . 

f r SEQ ID NO: 12 is the determined cDNA sequence for 54017.1 which 

shares homology with Mucin 1 1. ' ..!■-'■• 
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J Sm iD N ° : 13 ^ detenn^d cDNA sequence for 54006.1 which 
snares.homology with Mucin 11 

; - , ;. SEQ ID NO: 14 is ( the determined c P NA sequence for 53962.1 which 

shares. homology with Epiregulin (EGF family),,,., 

f SEQ ID ^o: .15 is the determined cDNA sequence for 54028.1 which 
shares homology with Mucin 12. ' ' " 

SEQ ID NO: 16 is the determined cDNA sequence for 54166 1 which 
shares homology witl^El^enWer biiadnig proton. ( , r .„ 

: , SEQ ip.NO:, 17 is the determined, cDNA sequence for 54174 1 which 

•shareshomology with.PAC cl<me RPl-lTpOl? from 7pl5-p21. 

SEQ ID NO; I § is the determined cDNA sequence for,53949. 1 . 
SEQ ID NO. V 19 i s the determined cDNA sequence for 53898.1. 
SEQ ID NO; 20 is the determined cDNA sequence fqr 54069.1 . 

• SEQ JM NQ: 2 1 is * e determincd cDNA sequence for 54048 1 which 
shares homology with cDNA .FLJ2Q676 fis, clone KA1A4294. 

: ,mJ$m.m^ determined cDNA sequence for 54031 1. which 
shares homology y^^mmmKb^m^^U^,, •■ , 

, SEQ ID NO: 23 is the determined cDNA sequence for 54154.1 which 
shares homology with Alpha topoisomerase truncated form. 

SEQ ID NO: 24 is the determined cDNA sequent for 54009 1 which 
shares homology with Cytokeratin 20. 

SEQ ID NO; 25 is the determined cDNA sequence for 54070 1 which 
shares homology with Erythroblastosis virus oncogene homolog 2. 

-SEQ 10^0: 26 is the determined cDNA, sequence for 53998.1 which 
shareshomologywith.Polyadenylatebindingprotein.il. 



30 



SEQ ID NO: 27 is the determined cDN A sequence for 54089,1. 
; SEQ ID NO: 28 is the determined cDWA sequence for 541 82 1 which 
shares homology with Transforming growth ^b^jn^ced. gene product. 

SEQ ID NO: 29 is the .determined cDNA sequence f or 53989 1 which 
shares homology with GDP-mannose 4,6 dehydratase, 

SEQ ID NO: 30 is the determined cDNA sequence for 5418L1. 
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> SEQ ID NO: 31 is the determined cDNA sequence for 54079.1 which 
s hareshomologywithPAC75Ni36nchromoso m eXq21.1. ' ^ 

SEQ ID NO: 32 is the determined cDN A sequence for 54114.1 which 
shares homology with Mus fork fead ^ tr^cription factor gene. : ' 

' SEQ ID NO: 33 is the determined cDNA sequence for 54160.1 which 

shares homology with Clone 146H21 oi chromosome^. ^ 

.. • ' SEQ ID NO: 34 Ts me daermined cDNA sequence for 54168.1 which 

shares homology wi m Glutamine^^ 

M - SEQ ID NO: 35 is the deterniined cDNA sequence for 54078.1 which 

shares homology with P AC 75N13 ^n chromosome Xq2 1.1. ~ _ 
- - * " - SEQ ID NO: 36 is the determined cDN A sequence for 53900.1 which 
shares homology with Internal ^ide-associated' transporter HPT-1 . 

SEQ ID NO: 37 is the determined cDNA sequence for 54147.1. 
' SEQ ID NO: 38 is the 'determined cDN A sequence for 54033.1 which 
' shares homology with Human proteose activated receptor-2: _ 1 

SEO ID NO: 39 is the determined cDNA sequence for 53908.1 which 

> shares .homology ^8mJ^n^ ^ f ° , '. 

SEQ ID NO: 40 is the determined cDNA sequence for 54022.1. 
SEQ ID NO: 41 is the determined cDNA sequence for 54039.1 which 



20 shares homology with Constitutive fragile sequence. 

SEQ ID NO: 42 is the determined cDNA sequence for 54037.1 which 

shares homology with CD24 signal transducer gene. 

SEQ ID NO: 43 is the determined cDNA sequence for 54129.1 which 

shares homology with Human c^yb gene. ' 
25 SEQ ID NO: 44 is the determined cDNA sequence for 54054.1 which 

' ' shares homology with Pyrrolme-t-carbbxylate synthase long form. 

SEQ ID NO: 45 is the determined cDN A sequence for 54055.1 which 

■■■■ * ^ shares homblogy wnh Human zinc'fihger^otein^-139. 

; SEQiDNO: ; 46isthedetermmedcD which 

30 Shares homology with Gene for ^ men^rane dof actor protem. ' ' 

' 1 1 ^ l - ■ • y : ; : " ■ , ■ 1 ' ■ ' : ...... 
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SEQ ID NO: 47 is the determined cDNA sequence for 54047.1 which 

shares homology with Colon Kruppel-like factor. ■ . . 

; ;ii , . SEQ ID >JO: 48 is the determined cDNA sequence for 54040.1 which 
shares homology with Human capping protein alpha, subunit isoform 1 . 
5 SEQ ID NO: 4,9 is the determined cDNA sequence for 54035.1. which 

shares homology with Ig lambda-chain. 

SEQ, ID NQ: 50 is the determined cDNA sequence for 54130.1 which 
shares homology with Protein tyrosine kinase. - , 

- SEQ JD NO:. $1 is the determined cDNA sequence for 54045.1 which 
10 shares homology with cDNA FLJ. 106 10 fis, clone NT2RP2005293, . 
, : ; ; V ; SEQ ip ( NO; 52 is the determined cDNA sequence for 54052.1 which 

shares homology :yyith Human miprotubule-assopiated protein 1, 
j > ■ SjBQ ID NO: 53 is the determined cDNA sequence for 54050.1 which 

( j -shares homology with Human retinoblastoma susceptibility protein. 
15 SEQ IP NO: 54 is the determined^cDNA sequence for 54051.1 which 

ric'i rlvv r : ' snares homology with Human reticulocalbin. 

SEQ ID NO: 55 is the r determined jCDNA sequence for 54178.1 which 

■jt — * ^ ' ... ■ ..... 

shares homology witl^ translation initiation factor elF3,p36 subunit. 
? u j SEQ ID NO: 56 is the determined cDNA sequence for 54148.1 which 

20 shares homology with Human ^purinic/apyrimidinic-endonuclease. 
t ; SEQ ID NQ: 57 is the determined cDNA sequence for 54058. L 

SEQ ID NO; 58 is the determined cDNjA, sequence for 54059.1 which 
I ; shares homology with Human integral transmembrane protein 1. 

SEQ ID NO: 59 is the determined cDNA sequence for 54126.1 which 
25 shares homology with Human serine kinase. 

SEQ ID NO: 60 is the determined cDNA sequence for 54127.1 which 
shares homology with Human CGI -44 protein. 

• 1 ... 1 * ; 

• '■ 

SEQ ID NO: 61 is the determined cDNA sequence for 54049.1 which 
: shares homology with fiADH/NADPH thyroid oxidase p 1 3 8-tox protein. 
30 SEQ ID NO: 62. is the determined cDNA sequence for 54056.1 which 

shares homology with Human peptide transporter (TAPl ) protein. 
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SEQ ID KfO: 63 is the determined cDNA sequence for 54064.1 which 
shares homology with Clone RP1-39G22 on chromosome Ip32.1-34B. 

SEQ ID- NO: 64 is the deterniihed cDNA sequence for 54124.1 which 
shares homology with Gloiie Tiiansforming growth factor-beta induced gene product. 
5 SEQ ID N(3? 65 is the determined cDNA Sequence for 54063 . 1 . 

1 SEQ ID NO: 66 is ttfe determined cDl<fA sequence for 54141.1 which 

shares homology with Cytokeratiri 8. f 

SEQ ID NO: 67 is the determined dDNA sequence for 54119.1 which 

' shares homology^iffi^ 
10 SEQ ID NO: 68 is the determined cDN A sequence for 54111.1 which 

L shares homology with Rumetanide-sensitive Na-K-Cl cotransporter. 

SEQ ID NO: 69 is the determined cDNA sequence for 54121.1 which 
shares homology withcDNA FLJ10969 fis, clone PLACE 1 000909. 
1 : ■ SEQ ID NO: 70 is the determined cDNA sequence for 54065.1 which 

15 shares homology with BAC clone 215012. - ^ - - : 

SEQ ID NO: 71 is the determined cDNA sfe^uence for 54060.1 which 
; • ' ; shaiis hbmolo'gy with 'A^utomitigfen -cflr&tibtilm: j va > r ' 

SEQ ID NO: 72 is the determined cDN A sequence for 54125.1 which 
1 shares homology with Human hepatic squalene synthetase. < 
20 SEQ ID NO: 73 is the determined cDNA sequence for 54143.1 which 

shares homology with tlUman RAD21 homolog. 

* 

x SEQ ID NO: 74' is the determined cDNA sequence for 54139.1 which 

shares homology with Human MHC class II HLA-DR>alpha. 
! SEQ ID NO: 75 is the determined cDNA sequence for 54137.1 which 

25 shares homology with Human Claudin-7. 

- SEQ lb NO: 76 is the determined cDNA sequence for 54044.1 which 

shares homology with Ribosonie protein S6 kinase 1. ; 

SEQ ID NO: 77 is the detehmhed ;; ct)NA sequence for 54042.1 which 

. . ■> ■» 

shares homology with CO-029 tumor associated antigen. 

50 ,; 11 SEQID NO: 78' is the determined cDN A- sequence for 54043.1' which 

shares homology with KIAA1077 p>rbteini 
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- SEQ ID NO: 79 is : the determined cDN A sequence for 54136.1 which 
shares homology with Human lipocortia II. , , . ,. 

I SEQ ID;NO: ,80 is the determined cDNA sequence for 54157.1 which 

, shares h<&jjology with PAC454G6 oji ch^pmpspme lq24. 

5 r SEQ ID NO: 81 is the d^termpe^GpNA sequ^ace for 54140.1. ; 

- r : ; L . t , SEQ ID NO: 82 is the determined cDNA sequence for 54 1 20. 1 . 

SEQ ID NO: 83 is the dete^ined qDNA sequence for, 54145.1 which 

< shares homology with fCIAAOl 52^; ; ^ \j ; t - ; 

SEQ ID NQ:,^4 u is the ; 4etermjned £PN^ se^uenqe. for. 541 17.1 which 

10 f sh^esliomqlpgyrwi^^^^ ; ) : r ; y oi 

SEQ IP Np> ( 85 is the determined cDNA. sequence for 54J 16.1 which 
shares homology wi$ UDP-N-^cety^ * 

SEQ ID NO: 86 is the determined, cPNA sequence fox$4l5lA . 
^ , SEQ ID NO: 87 i^fhe d^ermined cDNA sequence for 54152.1 which 

shares homology with Cystine/glutamate transporter. t , ^ ; , ^ , : : ; 
, ;; t §EQ iPiNO: 8$ is the : ^etf pained cDNA sequence for 54115. L 

SEQ ID NO: S&^^^te^^ sequent fQr, 54146.1 which 

sharesjiomology-wjth GAIfpH. ? 

SEQ ID NO: 90 is ^determined. cDNA sequence for 54155.1 which 
shares homology with cDNA DKFZp5&#)01 1 8. , 

SEQ ID NO: 91 is tfre determined cDNA sequence for 54159.1. 
SEQ IQ NO: 92 is the determine^ >cDN A sequence for 54020.1 which 
shares homology with Neutrophil lipocalin^ , . . r : < r v ; - 

> r SEQ ID NO: 93 is the detejrnined cDJSf A sequence for 54169.1 which 

shares homology with Nuclear matrix protein NRP/B : ; - . J •..-[ 

SEQ ID NO: 94 is the determined cDNA sequence for 54167.1 which 
shares homology with CG1-1 5 l i ^KJAA0!?92j>ro|eiri..^ < ;r . - - } ( 

SEQ ID NO: 95: is the determined cDNA sequence for 54030. 1 . 
SEQ ID.ltfQ: 96 is, the determined cDl^A^equpnce for, 54 161.1. 
a; ^ t SEQ >IJ> NO:, 97, is the determined qDNA sequence for 54162.1 which 
shares homology with Poly A binding protein;, , , , . , : . 
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SEQ ID NO: 98 is the determined cDNA sequence for 54163.1 which 
shares homology with Ribosomfe protein L13. :u - n ' 

SEQ ID NO: 99 is tfie determined cDNA sequence for 54164.1 which 
shares homology with Human alph4 enolase. E * ' 

• r SEQ ID NO: V00 is the determined cDNA sequence for 54132.1 which 

shares homology with Hufnan E^l eiizyme. T 1 i( 

SEQ ID NO: 101 J is the determined cDNA sequence for 54112.1 which 
sfesh^ 

SEQ ID N&: - tOZ is 'the •determined^ £DNA sequence for 54133.1 which 
10 shares homdlo^ 1 * 

SEQ ID NO: 103 is the determined 6DNA 1 sequence for 54165.1 which 
' ' ' Shares' homology with Bumetanide-sensitive Na-K-Cl cotrahsporter. 

SEQ ID NO: 104 is the determined cDNA sequence for 54158.1 which 
' shares homofogy with cDNA FLJ1 0549 fis; clone NT2RP2001976. 
15 • • ■ : SEQ ID NO: l'05 7 is the determined cDNA sequence for 54131.I which 

shares homology witii cDNA DlCFZp434C0523: - 1 ' - a • • * 

• ; SEQ ID NO: 106 is^detef^edcbW sequence for 54122.1. 
" i - ( ' K ' c ^ : ° } ; SEQ*© ftCi'lOT' isi&e d'etennSn^-cOHlA sequence for 54098.1. 

SEQ ID NO: 108 is me determined cDNA sequence for 54173.1 which 
20 shares homology with NADH-ubiquinone oxidoreductase ; NDUFS2 subunit. 

SEQ ID NO: 109 is the determined cDNA sequence for 54108.1 which 
shares homology with Phospholipase A2. i; 

t 

1 1 ' ' 3 ' SEQ ID NO: 1 1 0 ' is the determined cDNA sequence for 54 1 75. 1 which 
shares homology withcDNAFLJl0610 fis, clone NT2RP2005293. : 
25 ''' ! J SEQ ID NO; 111 is the determined cDNA sequence for 54179.1 which 
shares homology with Ig heavy chain variable region. 

' : SEQ ID NO: 1 12 is the determined cDNA Sequence for 54177.1 which 

? , r ' f 

shares homology with Protein phosphatase 2C 'gamma: 

' SEQ ID NO: 1 13 is the determined cDNA sequence for 54170.1 which 

30 r shares horiiology with Cyclin proteih. Vl 
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JSEQ ID NO: 1 14 is the determined cDNA sequence for 54176.1 which 
shares homology with Transgelin 2 (predicted). , : : ; y - , 

. SEQ ID NO: 1 1,5 is, the dpteprimed cDNA sequence for 54180.1 which 
shares homology with Human GalNAc-T^genejv j , , - . ; , ; 

5 SEQ ID NO: }}6 is the deterradned cDNA sequence for 53897.1 which 

shares homology with cDNA FLJ10884 fis, elope lvfTi2RP400l950. i{>{ \ e , 

SEQ ID NQ; 1 1 7. is t the detained cI>NA sequence for 54027. 1 . 
SEQ ID NO: 1.14$ isthe: d^te^nin^d cDNA sequenpe for 54133.1 which 
; ^shares homology vdfy /^pha to^^ 
1 0 SEQ ID NQ; cl 1 9 r is the ; ^etermiped cI?l^A secjuence fpr,541 07. 1 which 

, : shares homology with 1(^^1289. : J . - } \ >;4 (] \}T r \ 

: SEQ ID |sfO: .120 is the determined.^ which 
shares pomology with ^D022 proteiq. 0 1*11' : 

v.i..- SEQ^P tyO: 121 is the deter^ninedjcDNA sequence for, 53902,1. > 
15 : c ^QJD^Q: 122J$ the determined cf>NA sequence for 53918.1. which 

shares homology with Chromosome jl7 ? clppe hRJ:K:|6^2_E_l 8. 

, IC SEQ j^^}4^ f ^.the .dj^tprmiAed P^NA sequence for 53904. 1 . 
, -> SEQ 11^ NO: 124 is the determined ; eDNA sequence for 53910.1 which 

. . shares homology with cDNA FLJ 10823 fis, clone NT2RP4001080. 
20 *\ r SEQJD NO: } 25 As the determined cDNA sequence for 53903.1 which 

shares homology witih Vector. 

SEQ ID NO: 126 is the determined <?pNA sequence for, 54103.1. 
SEQ ID NO: J27 t is the determined cDNA sequence for 53917.1 which 
shares homology; >vith Cytochrome P450 J ; ^ t ~ 

25 SEQ ID NO; 128 is the determine^, cDN A sequence for 54004.1 which 

shares homology with CEA^ } /6 ^ --vv ' : , rr o; ^ 

SEQ IE) NO: 129 is the determined, cDNA sequence for 53913.1 which 
shares homology with Protein phosphatase ..(JS^PJ). ., . . .,, , ; . 

SEQ ID NO: 1 30,is the determined cDNA sequence for 54134.1 . 
30 SEQ ID NO: 131 is the determined cDl^A. sequence .for 53999.1 .which 

( ... . ~4 

shares homology with Alpha enolase. 
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SEQ ID NO: 132 is the determined cDNA sequence for 53938.1 which 
shares homology with Histone deacetylase HD 1 . 

SEQ ID NOV 133 is the determined cDNA sequence for 53939.1 which 
shares homology with citb_3 3 8_f_24, complete sequence. 
5 SEQ ID NO: 134 is "the determined cDNA sequence for 53928.1 which 

Shares homology with Huriian squalehe^epoxidase; 

ri 1 '» SEQ ID NO: 135 is ttie determined cDNA sequence for 53914.1 which 
shares homology with Human aspartyl-tRNA-^ynthetase alpha-2 subunit. 

; SEQ It) ^Ol 136 is the dfetermined cb^A sequence for 53915.1 which 
10 shares homology with Gamma^aetiri. : " r) °- ^ 

SEQ ID NO: 5 137 is the' determined : cDNA sequence for 54101.1 which 
shares 'homology with Human AP-rriu chain family member mu IB. 

SEQ ID NO: 138 is the determined cDNA sequence for 53922.1 which 
' shares homofogy with Hunian CctgrnRNA for chaperonin. 
15 1 ; ' SEQ ID NO: 139 is the determined cDNA' sequence for 54023.1 which 

shares homology with Chromosome 19. ; - ; ! " ; ; ' 

1 n 1 ' ■ SEQ ID NOV -MO' ^ which 
'sliares nomoldgy-with Hvimari ME<jF7. ' ■ ' 

' SEQ ID NO: 141 is'the determined cDNA sequence for 53921.1 which 

20 shares homology with Conriexin 26. 1 ! 

' SEQ ID WO: 142 is the determined cDNA sequence for 54002.1 which 

shares homology with Human dipeptidyl peptidase IV. 

*■ SEQ ID NO: 143 is the determined cDNA* sequence for 54003.1 which 
shares homology with Chromosome '5 clone CTC-436P 1 8. 
25 SEQ ID NO: 1'44 is the determined cDNA sequence for 54005.1 which 

shares homology wimtturrian 2-oxoglutarate dehydrogenase. 

SEQ ID NO: 145 is the determined cDNA sequence for 53925.1 which 
shares homology with RHO guanine nucleotide-exchahge factor. 

SEQ ID NO: 146 is the determined cDNA- sequence for 53927.1 which 
30 ' "shares homology with 12q24PAC RPC1 1-261 P5.' ' ' 
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: SEQ ID NO: 147 is the determined, gDNA sequence for 54083.1 which 
shares homology with Human colop. pjucopa-associated raRNA.,; 

r . .* SEQ-ID NO: 14$ is the determin,edf 9DNA sequence for 53937.1 . 

SEQ ID NO: 149; i$ the determined cDl^JA sequenpe for 54G74.1 which 
- -"w, v 5 . shares homology with Glone RP4-62 IE ^I»8 ? on chromosome lpl 1.4-21.3. 

SEQ ID NO: ; 15Q ^ is r the^eteirpined dQNA sequence.fpr 54105.1. 
y : SEQ ID NO: 151 r is th^ determined cDNA sequence for 53961.1 which 

shares homology with Hypianjembjtyo^cluiig protein. , ,~ , \ . .... .1 . , * 

, s -j.v ; : i- SEQ ID 1^0:, 152 js the^eten^ined el^N^ sequence for 53919.1 . 

10 SEQ ID NO: 153 is the determined ^c^^^^c^fo. ^9334^->vhich 

. . : , shares homology with Human leukocyte surface prpteiq jCD3 1 . 

; , SEQ ID NO: 15,4 is the determine^ cpNA-,sequence ^0.^559.72.1 which 
: , ; ' shares homology .with cDNA FIJI 0,679 fis, clone NT2R£2,0G6565 . 

, SEQ JED NO: 1 55 is the determined cDNA ; sequence for 53^06. 1 . 
■ -15 . SJEQ^ID^NO: 156 is . the ^^etennined cDl^A sequence for 53924. L which 

shares homology with Poly A binding/prot^ip. r i 
.u i: c SEQ ^Q^l^ i§ t^e^f te^mined gPNA ^sequence for 54144.1. 

SEQ ID NO: 158 is the determined cDI^A sequence for 54068.1 which 
' < f i shares homology with Cystic fibroses transmembrane conductance regulator. 
20 SEQ ID NO: 1 59 is the determined cDNA sequence for 53929. 1 . 

SEQ ID NO: 16Q is the detjennined pDNA sequence for 53959.1 which 
shares homology with KIAjA 1050. • ■ - f, 

SEQ ID NO: 161 is the determined cDNA sequence for 53942. 1 . 
SEQ ID NO: 162 is the; determined cDN A sequence for ; v 5,393 1.1 which 
25 shares homology with cDNA FLJ 1 11 27 fis, clone-PLACE 1 0JD6225 . 

SEQ ID NO: 163,,is me determined cDNA sequence for 53935.1 which 
shares homology .with Human set gene. 

SEQ ID NO: 1 64. is the. determined cDNA sequence for 54099.1 which 
: shares homology, with Human plecl<jstrin 2. 
30 SEQ ID NO: 165 is me.determined cDNA sequence for 53943.1 -which 

shares homology with KIAA0965. 
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SEQ ID NO: 1 66 is the determined cDNA sequence for 54000.1 which 

shares homology with Tis lid gene. 

SEQ ID NO: 1 67 is trie determined cDNA sequence for 54100.1 which 

shares homology with Cyhtokine (GRO-gamma). 
5 " SEQ ID NO: i68 is the determined cDNA sequence for 53940.1 which 

shares homology with Human p85McmmRN A. 
' V ""' 5 SEQ ID NO: 16^ is me determined cDNA sequence for 53941.1 which 

shares homology with cDNA DKFZp586H05l9: " ' 

' ' ; 5 SEQ ID NO: 1 ! 70 is the determined cDNA sequence for 53953.1 which 

10 shares homology witib SOX9. 

SEQ ID NO: \li is the determined cDNA sequence for 54007.1 which 
shares homology with' VAV-like protein. ' 

t- 

SEQ ID NO: 172 is the determined cDNA sequence for 53950.1 which 
shares homology with NFVE2 related factor 3. 
15 ' " " ' SEQ ID NO: 173 is the determined cDNA sequence for 53968.1 which 

shares homology with cDNA FLJ20127 fis/ clone COL061 76. 
: ; SEQmNdf'lWisme^ 

SEQ ID NO: 175 is the determined cDNA' sequence for 54091.1. 

SEQ ID NO: 176 is the determined cDNA sequence for 54013.1 which 

20 shares homology with Human argininosuccinate synthetase. - 

SEQ' ID NO: 177 is the determined cDNA sequence for 54092.1 which 

shares homology with Human serine kinase. 

' SEQ ID NO: i78 is the determined cDNA sequence for 54095.1 which 

shares homology with Clone RP1-1 55'G6 on chromosome 20. 1 
25 1 SEQ ID NO: 179 is the determined cDNA sequence for 53987.1 which 

shares homology with Human phospholipase C tfeta 4. 

' SEQ ID NO: 1 80 is the determined cDNA sequence for 53967.1. 

SEQ ID NO: 1 81 is the' determined cDNA sequence for 5 3 963 . 1 which 
shares homology with VAV-3 protein. 
30 5 5 SEQIE>NO:182isthedete^^ 
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.JJEQJD NO: 183 is the determined cDNA sequence for 54067.1 which 
shares homology with PAC RPCM 133G21 map21qll.l region D21 SI 90. 

. , t SEQ ID NO: 184 is^the determined cPN A sequence for 54057.1 which 
shares homology with Calcium-binding protein S l f 00P. 

SEQ ID NO: 185 is, the determined cDN A sequence for 54135.1 which 
shares homology with Human leupaxin. 

r SEQ ID NO: 1 86 is the^ determjned j cPNA sequence for 53969. 1 which 
shares homology with VAV-3 Protein. 
: ; y : ... .SEQ ID TsfO: / 1 87 is the determined fp^A sequence for 53970.1. 

SEQ ID NO: 188 is the determined cDNA sequence for 53966.1 which 

• -■ ' ■ ■ ' ■ ■ ■ V ; i ■ t i 1 1 '■. : 

shares homolpgy with hnRNP tj^e A/B protein. 

SEQ ID NO: 189 is the determined cDNA sequence for 53995.1 which 

* ! ■ 1 1 ( / r : v A:.. •;-"wv k i •. 

shares homology with Human cell cycle control gene QDC2, 

, SEQ ID NO: 1 90,is the jdetermine<J cDN A sequence for 54075. 1 . 

15 ; SEQ ID NO: 191 is me determine*} cDN,A sequence for 54094.1. . 

SEQ ID NO: 192 is the determined cDNA sequence for 53977.1. 

. s . fc : fD^TA sequence for 54123.1 which. 

, shares homology with B AC clone IJ.G083M05 from 7q2,l -7q22. 

■ ;i , SEQ ID NO: 194 is the determined cDNA sequence for 53960.1 which 

20 shares homology with Human STS WI- 14644. 

SEQ ID NO: 195 is the determined cDNA sequence for 53976.1 which 
shares homology with Human glutaminyl-tRNA synthetase. 

SEQ ID NO: 196 is the determined cDNA sequence for 54096.1 which 
shares homology with Human 26S proteasome-associated pad 1 homqlog. 
25 SEQ ID NO: .197 is the determined cDNA sequence for 541 10.1 which 

shares homology with Human squalene erjoxidase. , , 

SEQ ID NO: 1 98 is the determined cDNA sequence for 53920.1 which 
shares homology with.Human nuclear chloride ion channel protein. 

SEQ ID NO: 199 is the determined cDNA sequence for 53979.1 which 
30 , shares homology with PAC RPCI-l 133G21 m^p 21ql 1.1 region D21S190. 
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SEQ ID NO: 200 is the determined cDNA sequence for 5408 1.1 which 
shares homology with PAC clone RP5-1 1 8517 from 7ql 1 .23-q21 . 

■* - SEQ ID NO: 201 is therideteimined cDNA sequence for 54082.1 which 

shares homology with Human ephrin. 5 
5 l SEQ ID NO: 202 is the determined cDNA sequence for 53986.1 which 

shares homology with cDNA FLJ2G673 fis,clone KAIA4464 > 
- : _ \ . : . SEQ ID NO: 203 is the determined cDNA sequence for 53992.1. 

SEQ ID NO: 204 is the deterinined eDNA sequence for 54016.1. 
x ; ( i , ; i SEQ ID NO: 205 is the determined cDNA sequence for 54018.1 which 

10 shares ho&iGl6& ' l - 01 

5 e - - SEQ ID NO :^ 206 is the determined cDNA sequence for 53985.1 which 

shares homology with KIAA0715. ^ '< a;-' * ] 

. - SEQ ID NO: 207 is the determined cDNA sequence for 53973.1 which 

shares homology with Cyclin B. 
15 ^ > > : SEQ ID NO: 208 is the deteimined'cDNA sequence for 54012.1: which 

shares homology with KIAA1225, ^ * v • ; v ., x. *. .i; 

, rV- .-■).; ggQ ID NO: 209 

SEQ ID NO: 210 ibthe determined cDNA sequence for*53988.1 which 
shares homology with Colon mucosa-associated mRNA. 
20 SEQ ID NO: 2M is the determined cDNA sequence for 53990.1 which 

1 shares homology with cDNA FLJ20171 fis, clone GOL09761 . 

SEQ ID NO: 212 is the determined cDNA sequence for 53991.1. 
' r 1 SEQ ID NO: 213 is the determined cDNA sequence for 51519.1 which 

shares homology with CEA. 
25 SEQ ID NO: 21 4 is the determined cDNA sequence for 5 1 507. 1 which 

shares homology with Adenocarcinoma-associated antigen. * 
< f - * SEQ ID NO: 215 isthe determined cDNA sequence for 51435.1 which 

shares homology with Secreted protein XAGi ! 

1 l - " ! SEQ ID NO: 216 is the determined cDNA sequence for 51425.1 which 
30 shares homology with Aderiocarcindma-assdciated antigen. f - 

SEQ ID NO: 217 is the determined cDNA sequence for 51548.1. 
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SEQ ID NO: 218 is the determined cDNA sequence for 51430.1 which 
shares homology with CEA. : c . • u. : ; 

•' SE Q ED NO: 219 is ihe determined £DNA sequence for 51549.1 which 
shares homology with CEA. . r s . . , . r 

SEQ ID NO: 220 is the; determined cDNA sequence for 51439.1 which 
shares homology with Nonspecific crossreactingantigen;. . , .. , .. 

SEQ ID NO: 221 is the.detejmined cDNA sequence for 5 1 535 . 1 which 
i shares homology with Neutrophil igelatinase ^sociated lipocajin. 

SEQ IDJNO: 222;is the idetenriined iCDNA . sequence for 5 1486 J which 
1 0 shares homology with Transformation growth; factor-beta induced gene product. 
i ^ SEQ AD NO: 223 is ^ 

shares homology with Undetermined origin-found 5 ? tp NQA mRN^ ~ * 

SEQ ID ND: 224 is the determined cDNA; sequence for 51469.1 which 
shares homojogy with Galectin-4. 

1 5 ' 1 • SEQ IB NO : <225 is the i determined cDNA^s&quence for 5 1 470. 1 which . „ 

shares homology with Nonspecific crossreactijig antigen.. 

:/ " & B Q^>^^^ sequence for 51536.1 which 

shares homology with Secreted protein XMi; o-\ ' 

SEQ ID NO: 227:is the determined cDNA sequence for ,5 1483.1 which 
20 shares homology with Clone 146H21 on chrpmosome^Xq22. 

SEQ ID. NO: 228 is the determined cDNA sequence for 51522.1 which 
shares homology with GAPDH. t •>../.; . . 

; SEQ ID NO: 229 is the determined cDNA sequence for 51485.1 which 
shares homology with Mucin 1 1 . . , 

25 ' SEQ ID NO: 230 is the determined cDNA sequence for 51460.1 which 

shares homology with Nonspecific crossreacting,antigeq.,w , 7: . ..i 

. > SEQ ID NO:/ 231.is.the determined cDN£ sequence for 51458.1 which 
shares homology with KIAA05 17. protein. 

. : 1 SEQ ID NO; 232 . j^the determined cDNA sequence for 51506.1 which 

30 shares homology with Surface glycoprotein CD44. 
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SEQ ID NO: 233 is the determined cDNA sequence for 51440.1 which 
shares homology withiChromosome 21q22.1, D21S226-AML region. 

SEQ ID NO:;234 is the determined cDNA sequence for C907P. 
SEQ ID NO: 235 is the amino acid sequence for C907P; 
5 ; ; . A SEQ ID NO: 236 is the determine cDNA sequence for Ral2-C915P-f3. 

SEQ ID NO: 237 is the amino acid sequence for Ral 2-C9 1 5P-f3 . 
m ■* SEQ IDNO: 238 is the nucleotide sequence oMie AW 154 primer. 
SEQ ID NO: 239 is the nucleotide sequence of the AW155 primer. 
SEQ ID NO: 240 is the nucleotide sequence of the AW1 56 primer. 
10 SEQ ID NO: 241 is the nucleotide sequence of the AW1 57 primer. 

SEQ ID NO: 242 is the nucleotide^sequence of the AW1 58 primer. 
^ _ ; . , ; , SEQ ID NOt 243 is themucleotide sequence of the AW1 59 primer. 

\ . [ SEQ ID NO: 244 is the determined full-length cDNA sequence of 



'v„: C915P. •' '" ' - vv • 

15 , , (SEQ ID NO: 245 is the amino aeid sequence, encoded by the cDNA 

r ; sequence set forth in SEQ 3D NO:244.i 

I^^MLEDDESCMON OFTHEINVENTION - ■ • 

i . , The present invention isrdirected generally to compositions and their use 

20 in the therapy and diagnosis of cancer, particularly colon cancer. As described further 

below, illustrative compositions of the present invention include, but are not restricted 
..•> to, polypeptides, particularly immunogenic polypeptides, polynucleotides encoding 

such polypeptides, antibodies arid other binding agents, antigen presenting cells (APCs) 
i i . and immune system cells (e.g., T cells). 

25 ' the "'practice ''of the present invention will employ, unless indicated 

Specifically ^ of virology, immunology, 

! ! ^m6i^imo\^ia''hi^ arid Vecornbinant DNA techniques within the skill of 
1 > r the artV many of winch are ^described below for the purpose of illustration. Such 
: : fechriiques ^'"expl^'-fuUy'm^e literature. ,; See, e.g., Sambrook, et al. Molecular 
30 Cloning: A Laboratory Manual (2nd Edition, 1 989); Mariiatis et al: Molecular Cloning: 
A Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & II (D. 
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Glover,: ed.); 'Oligonucleotide Synthesis J (N. Gait, ed., 1984); Nucleic Acid 
Hybridization XB. Hames & S. Higgins, eds., 1985); r Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal CelKGulture (RL>Freshney, ed., 1986); Perbal, 
A Practical Guide to Molecular Cloning (J 984). - : ^ 

All publications, patents knd <praSent applications cited herein, Whether 
supra or infra, are hereby incorporated by reference in their entirety. 

LAs used in this specification arid the &p$ended tlaims, the singular forms 
"a," "an" and "the" include plural referenbes unless 'the content clearly dictates 



otherwise. 
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10 Polypeptide Compositions ; « .,rb ; :,K *Ov: *.T C v 

r As used herein, the ;term "polypeptide 11 " i&used in its conventional 

meaning, i.e., as a sequence of amino acids'*. Th6 polypeptides are not limited to a 
specific length of the product; thus, peptides, oligopeptides, and proteins are included 
\ ^ within the n definition of polypeptide, Hand guch terms maye'be used interchangeably 
15 herein unless specifically indicated otherwise! This term! also does not refer to or 
exclude post-expression modifications of the polypeptide, for example, glycosylations, 
acetylations, phosphorylations afid the KkeJasCwbllas otheii modifications Known in the 
t art, both naturally, occurring fend/non-naturally occurring. A polypeptide may be an 
- entire protein;, ^or a subsequence thereof. Particular; polypeptides of interest in the 
20 context of this invention -are amino acid subsequences comprising epitopes, i.e., 
antigenic determinants substantially responsible for the. immunogenic properties of a 
polypeptide; and being capable of evoking; an immune response.. : . q\> / 

Particularly illustrative polypeptides of the present invention comprise 
those encoded by a polynucleotide sequence se^t forth in any one of SEQ ID NOs; 1-234, 
236, and 244, or a sequence that hybridizes under, moderately stringent conditions, or, 
alternatively, undej highly stringent conditions, Jo a polynucleotide, sequence set forth 
in any one of SEQ ID NOs: 1-234, 236, and 244. Certain other illustrative polypeptides 
of the invention, comprise ^mino f acid sequences , as set forth . in any one of SEQ ID 
NOs:235, 237, and 245. 
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The polypeptides of the present invention are sometimes herein referred 
to as colon tumor proteins or colon tumor polypeptides, as an indication that their 
identification has been based at least in part upon their increased levels of expression in 
colon tumor samples. Thus, a "colon tumor polypeptide" or "colon tumor protein," 

r 

5 refers generally to a polypeptide sequence of the present invention, or a polynucleotide 
' - sequence encoding subh ^polypeptide, that is expressed in a substantial proportion of 
' "colon tumor samples, Tor example preferably greater than about 20%, more preferably 
' greater than about 30%!, and riidst prefeirably greater than about 50% or mbre of colon 
tumor samples 'tested- ak a level that is at least two fold, and preferably at least five fold, 
10' greater thaiv the level 1 of expression in noitnar tissues, as determined tiding a 
; i representative assay prov^ polypeptide sequence of the 

5 invention, ba^ed 1 upon its increased level of expression in tumor cells, has particular 
- lL utility both as a diagnostic marker as well as a therapeutic target; as further described 
; 'below. 1 " ' ; * " '■ * y ' ' 

15 ; r j ' ' ' In certain preferred emboHimerits, the polypeptides of the invention are 

immunogenic, i.e., they react detectably within an immunoassay (such 'as an ELISA or 
T-cell stimulation assay) witii zs&M&a^or T-cBlls^cSri £ patient with colon cancer. 
J - r -Screerfing for immunogenic activity can be performed using techniques well known to 
T the skilled* krtisan. For example!, such screens can be performed using methods such as 
20 those described in Harlow and Lane, Antibodies:' A Laboratory Manual, Cold Spring 

L. 

Harbor Laboratory, 1988. In one illustrative example, a polypeptide may be 
immobilized on a solid support and contacted' with patient 'sera to allow binding of 
antibodieis within the sera tb the immobilized polypeptide. Unbound sera may then be 
Amoved and bound antibodies detected using, for example, I-labeled Protein A. 

25 J 1 As would be recognized by the skilled artisan, immunogenic portions of 

the polypeptides disclosed herein are also encompassed by the present invention. An 

1 r ' *imiriimogenic portion," as used herein, 1 is a fragment of an immunogenic polypeptide 
1 of die invention that itself is immunologically reactive (/. e. , specifically binds) with the 
1 B-cells and/or T-cell surface antigen receptbrs that recognize the polypeptide. 

30 Immunogenic portions %ay 'generally be 'identified using' wtell known techhiques, such 
as those Suriimarized in iPiui; Fundamental Immunology, 3rd ed., '243-247 (Raven Press, 
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1993) and references cited therein. Such techniques include screening polypeptides for 
the ability to react with antigen r specific ( antibodies,, antisera and/or T-cell lines or 
: clones,. As used herein, antisera , and antibodies are "antigen-specific" if they 
specifically bind to an antigen (/.*,, they react wi^h the protein in an ELISA or other 
immunoassay, and do not react ^etectably, with plated proteins). Such, antisera and 
antibodies may, be prepared as described herein ? ,and using well-Jcnown techniques. 

In one preferred ^mbo^ment, an,. immunogenic portion-of a polypeptide 
of the present invention, is,a,jprtion^ reacts ^th^^^d/pj T r cells at a level 
that is not substantially,^ thanjl* react^y oft^e f^^m^P^m^(e.g., in an 
LEUSA and/or T-ceU reacti^ act - yity of 

< ^ immynQgenic m^nlspX least abo H t, 50%, preferably aUeast abwt 70% and most 
, preferably .greater, than about 9Q% ;( of the immunogenic^ ^ "the |. full-length 
' Pplypeptide - to «^>e ^stances, preferred immunogenjc^rtion? v^ ( b^;dentified that 
have a level ,of immunogenic activity greater than that of the corresponding .full-length 
n, polypeptide,,^.^., haying, grater %n about, 100% or, 150% or more immunogenic 
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1 t \ 

,; :>cto. mm$l^~m*$^?m **ratjv$ immunpgenic portions may 

sequence and/or transmembrane domain 
have been deleted. Other illustrative immunogenic portions will ; lcprrtain ' alnall N- 
and/or ^-terminal deletion (e.g,, 1-30 amino acid^ preferably 5-15 amino acids), 
relative to the mature protein, 

In another embodiment, a polypeptide composition of the invention may 
also comprise one or more polypeptides, that are immunologically reactive with T cells 
and/or antibodies generated against a polypeptide of the invention, oarticularly a 
polypeptide having an amino acid sequence, disclose^ herein, or to an immunogenic 
fragment or variant thereof. 

r - In .embpdimen.t of the, invention, polypeptides . are provided that 

comprise one ; or more polypeptides,^ are capable of^diciti^ ^ cells and/or 
antibodies that are immunologically reactive ^jth pne ; or more polypeptides described 
.herein, j0 r one or more polypeptides encoded by contiguous ^nucleic ; acid. | sequences 
contained in me polynucleotide , W 
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or variants thereof, or to one'or more nucleic acid sequences which hybridize to one or 
more of these sequences under conditions of moderate to high stringency. 

' The present invention, in ahothertspect, provides polypeptide fragments 
comprising at least about 5, 10, 15, 20, 25, 50, or 100 contiguous amino acids, or more, 
> deluding all intermediate lengths, of a polypeptide compositions set forth herein, such 
as 'those set forth in SEQ ID NOs:235, 237, and 245, or those encoded by a 
*" polynucleotide sequence set forth in a sequence of SEQ ID NOs:l-234, 236, and 244. 
r ? ' In another aspect, the present inventibn provides variants of the 

polypeptide 1 compositions described" herein. ' Polypeptide variants generally 
encompassed by the present invention will typify exhibit at least' about 70%, 75%, 
«, 85%, 90%; 91%T92W93%: 94%, 95%r96% 97%, 98%, of 99% or more identity 
(deterrhmid as described below), along Its length; to a polypeptide sequences set forth 
herein. 

' In one preferred embodiment, the polypeptide fragments and variants 
provided by the present invention are immunologically reactive with an antibody and/or 
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15 



T-cell that reacts with a full-length polypeptide specifically set forth herein: 



, .1 



In another j^^'^l^^'^U^^ fragments and variants 
5 provided by the present invention exhibit" a'level of immunogenic activity of at least 
about 50%, preferably at least about 70%; and most preferably at least about 90% or 
more of that exhibited by a mlWengm p'olypeptide sequence specifically set forth 

, ■ -it- -J . . 1 . 

herein. 

A polypeptide "'Variant," as the term is used herein, is a polypeptide that 
typically differs from a polypeptide specifically disclosed herein in one or more 
substitutions, deletions, additions and/or insertions. Such variants may be naturally 
occurring or may be synthetically generated, for example, by modifying one or more of 
the above polypeptide sequences of the invention and evaluating their immunogenic 
activity as described herein and/or using any of a number of techniques well known in 
the art. 

For example, certain illustrative variants of the polypeptides of the 
30 invention include those in which one or more portions, such as an N-terminal leader 
sequence or transmembrane domain, have been removed. Other illustrative variants 
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include variants in which a small portion (e.g., 1-30 amino acids, preferably 5-15 amino 
acids) has been removed from the N- and/or C-terminal of the mature protein 

In many instances, a variant will contain conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 
5 amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect {he secondary structure and hydropathic nature of the 

■ r ■ >- ■ ■ - .- - ' . ) 

. polypeptide to be substantially unchanged. As described above, modifications may be 
> ; v m ^ e ¥ s £ ucture of the polynucleotides and polypeptides of the present invention 

, 311(1 still obtain a functional molecule that encodes a variant or derivative polypeptide 

c*%" r™**i desirable characteristics, e,^. ? with immunogenic characteristics. When it is 
. ; desired ^o alter the amino acid sequence, of a polypeptide to create an equivalent, or 

even an improved, immunogenic variant or portion of a polypeptide of the invention 
one skilled in the art will typically change one or more of the codons of the encoding 
DNA sequence according to Table 1 . 

For example, certain amino acids may be substituted for other amino 
acids in a protein structure without appreciable loss of interactive binding capacity with 
^.structures such ,as ? for ex^jDle^ ^tigen ; bindinjg regions of antibodies or binding sites 
on substrate molecules. Since , it is, the interactive capacity and nature of a protein that 
defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence, and, of course, its underlying DNA 
coding sequence, and nevertheless obtain a protein with like properties. It is thus 
contemplated that various changes may be made in the peptide sequences of the 
disclosed compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 
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Table 1 



Amino Acids 


* ■ ■ , ■ • ' < ; 


^ • t i i 










Alanine 


Ala 


A ' 1 


GCA 


GCC 


GCG 






'Cysteine : 


Cys 


C .-. 


'a UGG . 


/UGU 




.... ,i 
:s . . } J ' 




Aspartic acid 


Asp 


D 


GAC 


ATT 

GAU 








Glutamic acid 


Glu 


E . 


GAA y 


GAG 








Phenylalanine 


Phe .,, 


F 


uuc 


V ■ If W V f 

uuu 

i * 






t ; 


Glycine 4 


Gly 


Q 

[ ir 


GGA . 


GGC 


GGG 


GGU 


i r ^ 


Histidine 


His 

»-,-. v 


H 


CAC 

-J ., j / i. ! 1 * 


CAU 

- i ■» 


■'i ... J"- 






Isoleucine , 


He 


I 


AUA . 


AUC 


AUU 

.- " T 




r' : 


Lysine 


Lys 


K 


AAA 


AAG 






■* • 


Leucine 


Leu 

* - 1 


L 


UUA 


UUG 


CUA 


cue 


CUG 


Methionine 


Met 




AUG 


* i i 








Asparagine 


Asn 


■J 

N 


AAC 


AAU 








Proline 


Pro 


P 


CCA 


^■■■■va rf 1 ™"* 

CCC 


CCG 


ecu 




r Glutamine ' 


Gin 


Q ' 


GAA 


GAG 






* 


Arginine 


Arg 


R 


AGA 


AGG 


GGA , 


. €GC 


CGG 


.*'■ Serine / \; 


^Ser , 


4 S 


•••:AGC . 


B AGU [ 


...sLiMGA 


UCC 


UCG 


^ThJieoriine , 


Thr 


. ,T 


.ACA 


m AGC, 


ACG. 


ACU 




Valine 


Val 


v , 


GUA 


; <3VC t: 


..GUG 

■ * :■ * . 


, GUU 




Tryptophan 


Trp 


w .. 


UGG 










Tyrosine 


Tyr 


Y. 

t 


UAC 


UAU 

i 


e 







CUU 



CGU 
UCU 
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In, making such changes, the hydropathic index of amino acids may be 
considered. The importance of the , hydropathic amino acid index in conferring 
interactive biologic function on a protein is, generally understood in the art (Kyte and 
Dpolittle, 1?82, ^incorporated herein by reference). , It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
re§ultaat .piojein, . which, >. .turn . defines the interaction of the protein with other 
molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
the like. Each amino acid has been assigned a hydropathic index on , the basis of its 
hydropbpbicity and charge characteristics (Kyte and Doolittle, 1982). These values are: 



wo F Ji/«&590 

* 
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IJZfU 



isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (- 
0.8); tryptophan (-0,9); tyrosine (-1.3); proline (-1.6); histidine (-3,2); glutamate (- 
3.5); glutamine (-3.5); aspartate:(-3.5)^asparagine (-3.5); lysine (-3.9); and arginine (- 

It is known iff the art that certain amino' acids may be substituted by 
other amino acids having a similar hydropathic index or score and still result in a 
protein with similar biological activity, i.e. still obtain a biological functionally 
equivalent protein. In making such changes, the substitution of amino acids whose 
hydropathic indices are within ±2 is preferred, those within ±i are particularly 
preferred, r and those within ±0.5 are even more particularly preferred. It is also 
understood in the art that the substitution, of like amino^acids can be made effectively on 
the basis of hydrophilicity. , U. S. Patent J,554,l 01 (specifically incorporated herein by 
reference , in, its entirety), states that the greatest local average hydrophilicity of a 
1 5 protein, as governed by the hydrophilicity; of its adjacent amino acids, correlates with a 
biological property <Jf the protein. j-.i 

J As >aetSiled ifnJ/U. S. latent 4,554,101,? the following > hydrophilicity 
values have been 'assigned to 11 amino acid residues: arginine (+3.0); 'lysine (+3.0); 
aspartate (-te.Oi 1); glutamate- {+3.0 + 1); serine (+0.3); asparagine (+0.2); glutamine 
20 (+0.2); glycine (0); threonine (-0.4); proline (-0.5 + 1); alanine (-0.5); histidine (-0.5); 
cysteine (-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); 
tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino 
acid can be substituted for another having a similar hydrophilicity value and still obtain 
a biologically equivalent, and in particular^ 'an immunologically equivalent protein. In 
25 such changes, the substitution of amino acids whose hydrophilicity values are within ±2 
is preferred, those within ±1 are particularly preferred, and those within +6.5 are even 

j * : , ... -\ ........ 

more particularly preferred. ' '' " r ' : : - u - lr: ; 

V * 

** *• . » 

As outlined above, amino acid'substitutions are generally therefore based 
on the relative similarity of the amino acid side-chain i-^stimenis;f6r^ example^ their 
30 hyarophobicityVTiydrophilicity, ctiarge/sike, arid the HkeV fixemplary substitutions that 
take various of the foregoing characteristics into consideration are Well known to those 
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' of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutarnine and asparagine; and valine, leucine and isoleucirie. 

In addition, any polynucleotide may be further modified to increase 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
5 flanking sequences at the 5' and/or 3' ends; the use of phosphorothioate or 2' O-methyl 
{ : - rather man phosphodiesterase linkages in the backbone; and/or the inclusion of 
hbhtraditibnal bases such ai inosine, queosihe' and wybutosihe, as well as acelyl- 
r rtitQ^, m6- '^ other modified forms of "adenine; cytidine, guanine, thymine and 

uridine. 

10 r • C" Amino acid substitutions Way ' fuMef be made on the basis of similarity 
A - pblaiify,'' cMge; ' solubllf^rhydro^hbHoty; hydrojMicity ; and/6r -toin^pathic 
r " ' * tiature' u of 'theresidu^! For example',; negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; and 
; ' Uino acids*^^ similar hydrophilicity values 

15 include leucine; isoleucine and 'valine; glycine and alanine; asparagine 4 and glutarnine; 
r : and seririe, threonine, phenylalanine' am^tyrb^^'<5ther'gr6t^of amino acids that may 
v; -•••'^p^ht^nser^ativc c^^s^W^ 1 ^^^ N>, gin, asn, ser, thr; 

'*-g)l$£k£ tyr; thr; (3) val, ile, leu, met/ala, pKe; (^^ys,' arg, his; and (5)phe, tyr, trp, 
his. [ A/ variant iriay also, or alternatively; contain noncbnservative changes. In a 
20 preferred embodiment, variant polypeptides' differ from a< native sequence by 
substitution, deletion or addition of five amino acids or fewer. Variants may also (or 
alternatively) be mddified by, for example, Hie deletion or addition of amino acids that 
have minimal influence' on the immunogenicity, secondary structure and hydropathic 
r nature of the'polypeptide. ~. ; ■ • 

25- - " >' " As noted- above, polypeptides may comprise a signal (or leader) 
V n sequence" aV the N^erminal [ end of the protein, which co-translationally or post- 
' ' translatibnally ^ polypeptide may also be conjugated 

to a linker- m other sequence for Wse of synthesis, purification or identification of the 
: --pWlyp^^ae (e:g'; Wy^s); Vto e^ce : binding of the poiy^eptide to a solid support. 
30 For exarnpie', a polypeptide may be conjugated to an inimuno^ 
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« ^ hen com Panng polypeptide sequences, |wo sequences are said to be 
"identical',-, if ; the. sequence of amino, acids, in the two sequences is the same when 
aligned for maximum correspondence,, as described below. Comparisons between two 
sequences are typically performed -by ; comparing, ^the sequences oyer a comparison 
; , 5 window to identify and compare local regions of sequence,snnilarity m a' "comparison 
window " * V** herein, .refeiis to a ?f gm ? pt,of at Jeast ,aJ)out 20 n co^igupus positions, 
usually,30 , t o about 75, 40 to about 50, in;, which a se^e ^^c^nd to a 
: :, ; reference sequence of the same number >of contiguous position after the.two sequences 
are optimally aligned. 

, . .i . . 

1 A ' ' '"*"'" J 

iWtic „ -C^mal ^fSmm^fm^^^m^n may be conducted.-using 

■ * e ¥ e gau'gn program in.,the jLasergene suite o^bioin^rmatics, software (DNASTAR, 
v : Inc., Madison, WI), using default ; pajameters. This . program, embpdies several 
.alignment schemes described ir^lhe following references: . , D ; ayhoff, M,Q, (1978) A 
v of evplutjonary change ^ proteins^ Matrices for detecting distant relationships. 
J5 tJi ,In Dayhoff, M.Q :i (ed.;> Atlas of Protein. Sequence ^Structure, Nat^nal Biomedical 
;s , !f£ rResearch Fo^atip^ ,^ashjr^ DC,VpL 5, .Suppl. 3, pp. 343-358;, Hein J. (1990) 
. Unified Approach tp A^gr^^jand^hy^en^s m 626-645 Metyds.in Enzymology 
183, Academic press, Inc r San Diego„CA ? Wggins,.D.G. and.^h^p,^ (1989) 
CABIOS 5:151-153.; Myer^W^d Muller W. 0?Sfi),fABfOS ; 4:1 1- ^Robinson, 
20 E.D. (1971)^0^ Theor 77:105; Saitou, N. Nei, M. (1987) fl M>/. Biol. Evol. 4:406- 
425; SneaA, P.H.A, and Sokal, R,R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, Sap Franciscp, CA: Wilbur, W.J. and 
Lipman, DJ. (1983) Proc. Natl Acad.,, Sci. USA 80:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
25 conducted by the local identity .algorithm of Smith and Waterman (1981) Add APL. 
Math 2:482, by me .identity alignment ajgorjjlim.pf ^^Needlemp^d ^Vunsch (1970) J. 
Mol. Biol. 48:443, by*e search.^ (1988) 

' PrOC - Natl * cqd - * m.U^h 2444 ? . bv ..cqmp^epzed ^excitations of these 
algorithms <GAP, ^SJTITJBLAST, ,FASTA, ;? nd TFA^inithe Wisconsin Genetics 
Software package, Genetics C^puter Group (GGG),. 575 Science Dr., Madison, WI), 
or by inspection. 
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- , > ; r i •k mo ' +w are suitable for determining 
One preferred example of algorithms that are sunao 

- J - a - ntitv ^ sequerice similarity are the BLAST and BLAST 2.0 
percent sequence identity ana sequcx 9*^X9-3402 

^gorithms, which are describe* fc AUscbu. e. a,. (,977) NucL ™ <^ 

J, Altschul ct a!. (.'990)7. toi'** Si'**** respecuve.y. BLAST and BLAST 

i S can be *— ~ '° ~ oT 

'sequence iden.i.y ^ Uie po ly nuc,eo«des and po.ypeptide. of «he rnvenuo n. Sofrw- 

' 2 ^forming BLAST analyses „ pub,* available .hrongh .he >«■- ~ * 
' BioLhnoWgy infonnanon. For *mnb acid Sequences, a scormg.na.nx can be »d • 

10 ^ cun^e'aii^ scoW ^otf b, *e X tan - — 

" Svedvalu*^ 

more ncna.ive.scorine ^d»e or U,c end of:«*er science , 

one or more negaxi * determine the sensitivity and 

reached: The BLAST algorithm parameters * . T and . X detfcrm 

speed of the alignment. 5(W5(v i. i<5 

- one Purred appro&W the "percentag, of sequenced » 

" defined by' comparing .wo ojitoaUy ah f ned «,»ences over a^window of 

- Iparison window ma, comprise adding or demons ^ ^ 

' or .ess usuaUy 5 to .5 percen., of' .0 to' 12 percent, 'as compared ,0 me reference 
20 ' Z—JL*** - comprise addiuons or de.e.id.^op.^a.ig^en ofme 

" vvhichle identical 'amino' &d residue o*urs in bom sequences .o yreld me number of 
^sitions, dicing number of matched ^ * — - 

25 lOOtoWeldiheperienUgeofsequenceidenaty. 

Within other uiusuative embodiment a po.ypep.rde may be a 
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w lmin *j iii^a i"^" — 
^neic peptide that comprises an polypeptide having sequence 

^ed as a reference ^de, bu, WU ch xenogeneic polype . *n£ 

£ altoen. 1 non-human species. On, stilled in 'me ar, wiU recogmze ma. 
from a differenl, ^ T .lympT.ocvte responses, 

"self 'antigens are often poor stimulators ot ana 
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and therefore efficient immunotherapeutic strategies directed against tumor 
polypeptides require the development of methods to overcome immune tolerance to 
particular self tumor polypeptides. For example, humans immunized with prostase 

, pm,ei " fom a x r cndc huma " ) ^ arc capaw - 

5 response against the counterpart human protein, e.g. the human prostase tumor protein 

present invention provides methods for 
L purifying the xenogeneic ; form of , the tumor proteins set forth herein, such as the 
polypeptides set .forth in. SEQ ID Nps:235, 237, and 245, or those encoded by 
; polynucleotide sequenqej? set forth in SEQ ID NOs: 1 -234, 236, and 244. 
10, sl .,»; Therefore, one,, aspect of me present invention provides xenogeneic 

variants of the polypeptide .compositions described herein. Such xenogeneic variants 
generally encompassed by the present, invention wiil typically ^ exhibit at least about 
. 70%, 75%, 80%; 85%, 90%, 91%, 92%, 93%^ 94%,' 95%, 96%"' 97%,' 98%, or 99% or 
more identity along their lengths, to a polypeptide sequences set forth herein! 

Ivjore particularly, . Jhe invention is directed to mouse, rat, monkey, 
porcine and o^npnyhuman i^y^tides^which can be used as xenogeneic forms of 
hu^.^lyp^ptid.es^t.fQrOj heroin,, tp>duce immune responses directed against 
tumor polypeptides of the invention. : ' !r> 
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Within other illustrative embodiments, a polypeptide may be a fusion 
20. polypeptide thatcomprises multiple polypeptides as described herein, or that comprises 
at least one polypeptide as described herein and an unrelated sequence, such as a known 
tumor protein. A fusion partner may, for example, assist in providing T helper epitopes 
(an immunological fusion partner), preferably T helper epitopes recognized by humans, 
or may assist in expressing the protein (an expression enhancer)' at higher yields than 
25 the native recombinant protein. Certain preferred fusion partners are both 
immunological and expression enhancing fusion partners.^ Other fusion partners may be 
, selected so as to increase, the solubility of the polypeptide or to enable the polypeptide 
to be targeted to .desired, in^acellular compartments. Still further fusion partners 
include affinity tags, which facilitate purification of'the poivoerjtide ' 

.. . Fusipn : polypeptides may generally be prepared using standard 
t techniques, including chemical conjugation. Preferably, a fusion polypeptide is 
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expressed as a recombinant polypeptide, allowing the production of increased levels, 
relative to a non-fused polypeptide^ in an expression system! Briefly, DNA sequences 
encoding the polypeptide components may be assembled separately, and ligated into an 
appropriate expression vector. "The 3' end of the DNA sequence encoding one 
5 polypeptide component is ligated, with or without a peptide linker, to the 5 1 end of a 
! D1^A sequence encoding the second' polypeptide component so that the reading frames 
' of the sequences are in phase. This permits translation into a single fusion polypeptide 
that retains the biological activity of both component polypeptides. 

hi ; i- A peptide linker sequence' may be employed to separate the first and 
10 second polypeptide components by a distance 'sufficient to ensure that each polypeptide 
; : folds into its secondary and tertiary structures?" Such a peptide linker sequence is 
1 incorporated into W fusion polypeptide Wing standard techniques well known in the 
art! Suitable peptide linker sequences may be chosen based on the following factors: 
n (1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 
15 ' secondary structure that could interact with functional' epitopes oh the first hand second 
polypeptides; and (3) the lack of hydrophobe or 'charged residues mat might react with 
^ 1 ' the porypeptide functional ep^opes^^Preirred" peptide linker sequences contain Gly, 
' Asn and Se r residues. Other near neutral amino acids, such as Thr and Ala may also be 
1 used in the linker sequence. Amino acid sequences which may be usefully employed as 
2 b linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al., 
' Proc. Natl. Acad. Sci. USA 53:825^8262, 1986; U.Sl Patent No. 4^35,233 and U.S. 
1 Patent No. 4,75 1 ,1 80. The linker sequence may generally be from 1 to about 50 ammo 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to 
25 separate the functional domains and prevent steric interference. 

' ' ; - The' ligated ' ! i>NA sequences are operably linked to suitable 
' ' transcriptional ' or translational "regilatory elements. The regulatory elements 
responsible for expression of DNA are located only 5' to the DNA sequence encoding 
' the ' first polypeptides. Similarly, stop codons ' required to%nd translation and 
30 transcription termination signals are only present y to the DNA sequence encoding the 
second polypeptide. 
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The fusion polypeptide can comprise a polypeptide as described herein 
together with an unrelated immunogenic protein, such as an immunogenic protein 
capable of eliciting a recall response. Examples of such proteins include tetanus, 
tuberculosis and hepatitis proteins (see, for example, Stoute et al. New Engl J. Med., 
5 53(5:86-91,1997). 

In one preferred embodiment, the immunological fusion partner is 
derived from a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 
fragment. Ral2 compositions and methods for their use in enhancing the expression 
, ■ , t n and/or inununogenicity of heterologous polynucleotide/polypeptide sequences is 
.,...-,.10 described in U.S. Patent Application 60/158,585, the disclosure of which is 
incorporated herein by reference in its entirety. Briefly, Ral2 refers to a polynucleotide 
region that is a subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. 
MTB32A is a serine protease of 32 KD molecular weight encoded by a gene in virulent 
and avirulent strains of M tuberculosis. The nucleotide sequence and amino acid 

15 sequence of MTB32A have been, described t (for example, U.S. Patent Application 
60/158,585; see also, Skeiky et al, Infection gnd Immun. (1999) 67:3998-4007, 
incorporated herein by reference). C-terminal fragments of the MTB32A coding 
sequence express at high levels and remain as a soluble polypeptides throughout the J 
purification process. Moreover, Ral2 may enhance the immunogenicity of heterologous 

20 immunogenic polypeptides with which it is fused. One preferred Ral2 fusion 
polypeptide comprises a 14 KD C-terminal fragment corresponding to amino acid 
residues 192 to 323 of MTB32A. Other preferred Ral2 polynucleotides generally 
comprise ^t least about 15 consecutive nucleotides, at least aboitf 30 nucleotides, at 
least about 60 nucleotides, at least about 100 nucleotides, at least about 200 nucleotides, 

25 or at least about 300 nucleotides that encode a portion of a Ral2 polypeptide. , Ral2 

i * . ,l . ■ * * * * ► j * I * *" * ' h * ** 

polynucleotides may comprise a native sequence {i.e., an endogenous sequence that 
encodes a Ral2 polypeptide or a portion thereof) or may comprise a variant of such a 
sequence. Ral2 polynucleotide variants m^y contain one or more substitutions, 
additions, deletions and/or insertions such that the biological activity of the encoded 
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fusion polypeptide is not substantially diminished, relative to a fusion polypeptide 
comprising a native Ral2 polypeptide. Variants preferably exhibit at least about 70% 
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identity, more preferably at least about 80% identity and most preferably at least about 
90% identity to a polynucleotide sequence that encodes a native Ral 2- polypeptide or a 
"portion thereof. < t ■ ; ; ; / ? ; 

Within othercpreferred embodiments, an immunological fusion partner is 
5 derived from protein D, a surface protein of the gram-negative bacterium Haemophilus 
* ! influenza B (WO 91/1 892 6). Preferably, a protein 'D derivative comprises 
r approximately "the first third of the protein (e.gs, the first , i N-terminal 100-110 amino 
^acids)^ and a protein D ^derivative may bb -lapidated. Within certain preferred 
r ' embodiments, the first 409 residues of a Lipoprotein D fusion partner is included on the 
10 N-termirius to provide the polypeptide with additional exogenous T-cell epitopes and to 

* -increase the expression leVei in E % ^coli (thus functioning as an 1 expression enhancer). 
The lipid -tail ensures optimal presentation of the antigen, to antigen presenting cells. 

J - Other fusion partners include the non-structural protein from influenzae virus, NS1 
; (hemaglutinih). Typically, the N-tenninal 81 amino acids are used, although different 
15 fragments that include T-helper epitopes may be used. - ;v • • : i 

¥ 

In another embodiment, the immunologic^^ is the protein 

- ; 1 ^ portion therb^^ portion). LYTA is 

hi /u ^^a^rived ' from Streptococci ^neuntomde 9 v^lnl N^acetyl-L-alanine 

-■ £midase> known as amidase LYTA (encoded by the LytA gene; Gene 43:265-292, 

» 

20 1986). LYTA is an autolysin that specifically degrades * certain bonds in the 
peptidoglycan backbone. The C-terimnal domain of the LYTA protein is responsible 
for the affinity to the choline or to some choline analogues such as DEAE. This 
property has been exploited for the development of E. coli C-LYTA expressing 
plasmids useful for expression of fusion proteins. Purification of hybrid proteins 
25 containing the C-LYTA fragment at the amino terminus has been described {see 
: Biotechnology 70:795-798, 1992). Within a preferred embodiment, a repeat portion of 
LYTA may be incorporated into a fusion polypeptide. A repeat portion is found in the 

* Oteririihal region starting- at residue 178. A 1 particularly preferred repeat portion 
=*• incorporates residues 188-305. M 

^30 :r Yet another illustrative embodiment involves fusion polypeptides, and 

tHe polynucleotides 7 encoding them, wherein 1 the fusion partner comprises a targeting 
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signal capable of directing a polypeptide :to the endosomal/lysosomal compartment, as 
described in U.S. Patent No. 5,633,234. An immunogenic polypeptide of the invention, 
when fused with this targeting signal, will associate more efficiently with MHC class II 
molecules and thereby provide enhancfed dn !vivp stimulation of CD4 + T-cells specific 

5 for the polypeptide. ; > a* r: ' ; ■ ^ l ! 

; Polypeptides of ihe invention ar^ pirepared using any of^ yariety of well 
: i : known synthetic and/or recombinant t^hmques^ the latter of which ; are further 
; < described below. Polypeptides, portions and Other variants generally less >than about 
o J50 amino acids can: be generated by/synthetic mean?, u^ng techiuq^iw?ll known to 
V' ,10 - those of ordinary* skill" .in- ..thelarti,-; In one t illustrative example^ such;: polypeptides are 
• :.• ■ ■<:, synthesized using any of the commeroiallylavailable solid-phase tephnique&such as the 
, Merrifield solid-phase synthesis method* wh^rq aniino spids.are sequentially added to a 
, : \ gnowingjamino acid chain. See Merrifield* Jl Am. Chem. Soc . 85& 146, 1963. 

Equipment for automated synthesis of polypeptides is commercially, available^ from 
15 suppliers such as Perkin Elmer/AppliedjBioSy^ms Division (Foster, Gity, CA), and 
rciovoiq . : . may be operated! according to the mfcnufacj^rer's iiastruptions. 

: / ; v > r ; fe^ (including fusipn ; p<?lypeptides) of 

,'t 

«. ■• ! vthe-inwntion. are isolated.! An "isolated" polypeptide is one that js; removed from its 

<■ > original environment. For example,; a naturally-occurring protein or polypeptide is 
-20 i isolated- if ,it is separated from some or all of the coexisting materials, in the natural 
systenk Preferably, such ^polypeptides- are also purified, e.g., are at least about 90% 
i ; T pure, more preferably at least about 95% pure.and most preferably at l.east about 99% 
\; pure. --.V ' ■ ' " v fs -'C'"'- ' 

Polynucleotide Composition > - i 

25 The present invention, in other aspects, provides polynucleotide 

compositions. The tenns- ' ! QNA"^and; ^olynuqjeotide"' are used . essentially 
; L , interchangeably herein to refer tq\a DNA molecule that has ; been isolated free of total 
genomic DNA of a particular species. "Isolated/ 7 as .used herein, means that a 
- 1: . pplynucleotide isrsubstanJially a^yay frQm ot^er coding sequences, and that the DNA 
: , .30 a molecule . does nqt, contain large potions of unrelated pqding DNA, such as large 
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chromosomal fragments or other functional genes or 'polypeptide coding regions. Of 
coursfe, this refers to the DNA molecule as originally isolated, and does not exclude 
genes or coding regions later addecl to the segment by the hand of man; 

As will be Understood by those skilled in the art, the polynucleotide 
5 compositions of this invention can include genomic sequences, extra-genomic and 
plasmid-encoded sequences and smaller eiigirieered gene segments that express, or may 
' fee adapted to express/ protein^, jx>l>^eptides ? peptides and the like. Such segments 
may be naturally isolated, or modified synthetically by the Hand of man. 

AS will fee also fecbgrlized by the -skilled artisaii, polynucleotides of the 
10 invention may be Single-stranded (coding or iritisense) or double-stranded, "and may be 
T>NA (genomic, bDNA or Synthetic) or RNA molecules. RNA molecules may include 
HnRNA molecules, which contain introns and correspond to la DNA molecule in a one- 
to-one manner, and mRNA mofeeiites, which do not contain introns. Additional coding 
. < ■ i 

or non-coding sequences inay, but need not, be present within a polynucleotide of the 
15 present invention, arid elrpolyniicleoiidb may, but need not, be linked to other molecules 

an<W support materials^ -^..> ^rr<^ ,:. i. .urr> -:■■..< , h** 

Pbr^ucledtides hi^y ^o l ni^nsS J aT ftalfve leqtienfce 
sequence that encodes a polypeptide/p^dtein of th6 r inventibn dr' a poirtion thereof) or 
may comprise a sequence that encodes a Variant or derivative, ' preferably and 
20 inunundgenic variant "or derivative, of such Pa sequence. " ; 

Therefore, according to another aspect of the present invention, 
polynucleotide compositions are provided that comprise soine or all of a polynucleotide 

i 

sequence set forth in any one of SEQ ID NOs: 1-234, 236, and 244, complements of a 
polynucleotide sequence "set forth in any one of SEQ ID NOs: 1-234, 236, and 244, and 
25 degenerate variants of a polynucleotide sequence set forth in any one of SEQ ID NOs:l- 
234, 236, ! and 244. In certain preferred embodiments, the polynucleotide sequences set 
forth herein encode immunogenic polypeptides, as described above. u 

In other related embodiments, the present invention provides 

polynucleotide variants having substantial identity to the sequences disclosed herein in 

*\ * , 

30 SEQ ID NOs: 1 -534, 236, ahd 244- for example thdse cbmprising at least iWo sequence 
" identity, preferably at least '75%,' r 80%,' 85%, 90%; 95%, 96%, 97%; 98%- or 99% or 
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higher, sequence identity compared to a polynucleotide sequence of this invention using 
the methods described herein, (e.g., BL/±$T analysis using standard parameters, as 
described below). One skilled in this ait. \\flll recognize that these values can be 
appropriately adjusted tq determine coprpsponding identity of proteins encoded by two 
5 nucleotide sequences by taking inter apcoupt c^pn degeneracy, amino acid similarity, 
reading frame positioning ai)d the, like. ; . rc . , _ ; 

Typically, polynucleotide variants \yill contain one or more substitutions, 
additions, deletions and/or ipsertiofls, preferably, ,sq<?h thpt the iinmunogenicity of the 
polypeptide encoded ,by ^ substantially diminished 

r 1 P. h relative to a pqlj^ptide e^co<Jed fc>y a, pp^nuclpoti^ set forth 

herein). The tejpi "variants'! should also be ^d9rstoo,d|to encompasses homologous 

genes of xenogeneic origin,, ,, : f0 _ ; .,. A , ,r rM 

In .additional ., embodin>eirts, £be, .--^present inyeptipn . provides 
polynucleotide fragments comprising or consisting of various lengths of contiguous 
Stretches of ^sequence identical to, ox wmplemeiita^ to one or more of the sequences 
disclosed herein. For example, polynucleotides are; provided by this invention that 
., T oompri^ or ( cpnsi§t c$£. feapt §^y& 1^, 2%£0^5$, 75, 100, 150,200, 300,400, 
^5j00 or 1Q00 or more contigU9us ( jiu r cleotidps of one or more of th^ sequences disclosed 
herein as well as all intermediate lengths there between. It will be readily understood 
that "intermediate lengths 11 , in tjns context, means apy length between the quoted 
values, such as 16, 17 ? 18, .1% etc.; 21, 22, 23, etc.; 30 ?i 31, 32, etc.; 50, 51, 52, 53, etc.; 
1Q(], 101, 102, 103, etc. ; 150, 151, 152, 153, etc.; jncluding ^11 integers ^through 200- 
500; 500-1,000, and the like. A polynucleotide sequence as described here may be 
extended at One or both ends by additional nucleotides not found in the native sequence. 
This additional sequence may consist of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 

•■ : .U.' \ . .j i { —O . ' , . ' < ■ ; \ \tt l T : 1 ' "> > i " 

- i 

16, 17, 18, 19, or 20 nucleotides at either end of the disclosed sequence or at both ends 
of the disclosed sequence, , . 

In another embodiment of the invention, polynucleotide compositions 
; . are ; provided that are capable of hybridizing under moderate to high stringency 
conditions tp T a pol^nuclepjtide . sequence ^rpvided herein, .or, a fira^nen|, thereof, or a 
. complenientary. sequence* thereof.. Hybridization techniques are well known in the art 
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of molecular biologyt For purposes of illustration, suitable moderately stringent 
conditions for testing the hybridization of a polynucleotide of this invention with other 
polynucleotides include -prewashing iri ; a solution of 5 X SSC, 0.5% SDS, 1.0 mM 
EDTA (pH 8.0); hybridizing at r 50°C-60°C, 5 X SSC, overnight; followed by washing 
5 twice at 65°C for 20 minutes with each bf 2X, 0.5X and 0.2X SSC containing 0.1% 
SDS. One skilled in the art will understand thkt the' stringency of hybridization can be 
readily manipulated, such as by altering the salt content of the hybridization solution 
and/or the tdiriperature at which the hybridization is performed. For example, in 
another embodiment, suitable' highly stringent Hybridization conditions include those 

10 'described above, 'with the exception' that the temperature of hybridization i£ increased, 
; elg^ to 60-65°C or 65-70°C/ — '' : n ' : ^ - ' - : " 

" ,,J '•' ! In certain preferred embodimentSj the polynucleotides described above, 
e.g., polynucleotide variants, fragments and hybridizing sequences, encode 
polypeptides 1 that are immunologically crbss-reacfive with a polypeptide sequence 

15 ! - specifically set forth herein. In othdr preferred 1 ehibbdimerifs; such polynucleotides 
encode polypeptides tfrat 'have el 16vel 6f J immimbgehie activity of at least about 50%, 

: 1 pr^My at l^kst ^otA'70^/^ M^ prfefirabl^% Uast db6ur90% of that for a 

~ A pblyp^ptid^ sequence Specifically set fbrA herein.' ViA ' 

" * The polynucleotides of the present invention, or fragments thereof, 

it)*' regardless of the length of the coding sequence itself, may be combined with other 
DNA sequences, such as promoters, polyadenylati6h signals, additional restriction 
enzyme sites, multiple cloning sites, other coding segments, and the like, such that their 
overall length may vary considerably. "It is therefore contemplated that a nucleic acid 
fragment of almost any length may be employed, with the' total length preferably being 

25 limited by the ease of preparation and use in the intended recombinant DNA protocol. 
For exairi^lfej illustriativ^ 1 polynucleotide segments with total lengths of about 10,000, 
1 about 5000,' about 3000, about 2,000, about i;000,' about 500, about 200, about 100, 
about 50 b&se pairs in length, and the like^ (including all interinediate lengths) are 
contemplated to be useful in many implementations of this invention: 

30 When comparing polynucleotide sequences, tWo sequences are said to be 

"identical" if the sequence of nucleotides in the t<vo sequences is the same when aligned 
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for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed , by comparing the sequences qyer a comparison 
window to identify and : compare lqpal regioijs rf, sequence similarity. A "comparison 
window" as used herein, refers ^to a^s^gnient ofatf least at*out 20 contiguous positions, 
5 usually 30 to about 75, 40 to about 50, ip which a sequence n^ay be compared to a 
reference sequence of the same miml?€f of qoptiguous positions after ,t}ie two, sequences 
are optimally aligned. .. , * . ; v , 5 , ; . , . 

■ -. ■•_-<„ Optima^lignmeniof^ 
, ; the MegaHgn program in, the Lasergene siu'te of biojnforma^cs so^are ^NASTAR, 

: 1Q,., Jnc., Madison,. Wl^using default parameters. , -Thjs,^ program- emhodies several 
alignment schemes described in the following references: r^ayhoff, ,M.Q. (1978) A 
model of: evolutionary change^n proteins,- Matrices for detecting distant relationships. 
m,Dayhoff, M.O,.(ed.) Atlas of JProtein Sequence, and Structure^ Rational Biomedical 

,,. Research Found^on, Washington,^? Vol. 5, Suppl. 3, pp. 345-358;,Heifl J. (1990) 

X^ v : Unified Approach. ;to ; i^Jigi^wt^d Phylogene^pp. 626-645 ^Methods in Bnzymology 
T; f h vol 183, Aca^demk Press_, and ,J3harp, P.M. (1989) 

U :l QW™:te?M$& -WWk&Wa (W) CAJJQS ,4:11-17; Robinson, 

E.D. (1971) Comb. Theor 7,{:ip5- ^antou, 1ST. Nes, M. (1,987) ,^pL u BipLEvol. 4:406- 
42^; Sneath, P.H.A. and.Sokal, R.R. (1913).Numerical Taxonomy -the Principles and 
20 Practice of Numerical Taxonomy ? ,Freejnan Press, San Francisco, CA* Wilbiir, W.J. and 
, ■ Lipman, D.J. (1983) Proc. Natl. Acad, Sci. USA £0:726-730. 

Alternatively, , optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. 
. Math 2:482, by the identity alignment algorithm of N^edleman and Wunsch (1970) J. 
25 Mol Biol. 48:443, by the search for similarity meth<#is of Pearson ^nd Lipman (1988) 
Proa, m Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, .and TEASTA in.the Wisconsin Genetics 
♦r Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection., ...... r 

?Q : /:J ... Qne preferred exsqiiple of algorithms that are suitable for determining 

( r- , percent , sequence identity and sequence similarity are the BLAST and BLAST 2.0 
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: algorithms, which are described in Altschul et al. (\917).Nucl r Acids Res. 25:3389-3402 
and Altschul et al. (1 990) J. MoL Biol. :2 1 5 :403-41 0, respectively. BLAST and BLAST 
.2.0 can be used, for example with the parameters described herein, to determine percent 
' sequence identity for the polynucleotides of the invention. Software for performing 
5 , BLAST analyses is publicly available through the National Center for; Biotechnology 
• Informations In one illustrative example, cumulative .scores can be calculated using, for 
nucleotide sequences; the parameters Mt (reward score>fot a pair of matching residues; 
.. ; always 5*0) and N (penalty" score; sfor jrusmatching residues; . always <0). Extension of 
the word hits in each direction arehalted when: the cumulative alignment score falls off 
1 0J :by 1&e quantity X from its maximum, achieved; value; the cumulative* score goes to zero 
S. or below, due to the accumulation of one or. more negative-scoring residue alignments; 
■ ' ■'. or me end of either sequence is reached. .• The BLAST algorithm parameters. W, T and X 
.determine the sensitivity and speed of the. alignment. The BLASTS .program (for 
- nucleotide sequences) uses as defaults a wordlength (W) of 1 1 , and expectation (E) of 
15 10, and 1 the BLOSUM62 scoring matrix (see Henikoff and Kenikoff (19.89) Proc. Natl. 

Acad. Sci. USA 89:10915) alignments, (B) of 50,; expectation- (E) of 10, Mf=5, N=-4 and 
-r ora comparison of both strands, rv/oilc ?.>ss>ns>actyrr« ^^-^-^ 

Ui ^..rrvi.Dc^ » Preferably, the r "percentage of i sequence iiidentity" ,^is determined by 
• ,t« comparing two optimally .aligned sequences over a window of comparison of at least 20 
20 ' pbsitions, wherein the portion of the polynucleotide sequence in the comparison 
' window may comprise additions or deletions (i.e., gaps) of 20 percent, or less, usually 5 
to *1 5 percent, or 10 to 12 percent, as compared to the reference sequences. (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is -calculated by determining the number of positions at which the identical 
25 • - nucleic acid -bases occurs in both sequences to yield the number of matched positions, 
'dividing the number of matched positions by the total-number of positions in the 
reference sequence (i.ei, me. window si?e) and multiplying the results by 100 to yield 
- the percentage of sequence identity, o — - 

. l>.; : -. it will-be appreciated by those of ordinary skill in the art that, as a result 
30 of the degeneracy of the genetic code, there are many nucleotide sequences that encode 
. a ; polypeptide as described herein. • Some of these polynucleotides ibear minimal 
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j homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides 

that vary due to differences in codon usage are specifically contemplated by the present 
invention, Further; alleles of the genes ^comprising the; polynucleotide sequences 
provided herein are within the scope ; 6f the present invention. Alleles are endogenous 
5 genes that are altered as a result of one or more mutations, such as deletions; additions 
and/or substitutions of nucleotides; The resulting mRNA and protein may, but need 
not, have an altered! structure or function. .Alleles. -may be, identified lising standard 
techniques (such as hybridization^ amplification and/ox database sequence comparison). 
! ? - i : Therefore, in another embodiment of the ;inveritidn* a mutagenesis 

q; ^ 10- approach, such as site-specific mutagenesis^ is eeimployed v for the preparation of 
immunogenic variants and/or derivatives of ^tiae polypeptidfetdescribed h^ein. By this 
approach; specific modifications : in a polypeptide .sequence can; be made through 
mutagenesis Jof the underlying polynucleotides that encode rthem. Xhese »techniques 
1 provides a straightforward approach to prepare and test sequence variants, for example, 
. v 15 ^incoipbrating one or mtfre oftthe foregoing considerations, by introdutiiitg one or more 

Site-specific mutagenesis allows .the production of mutmt& through the 
in use; of specific- oligonucleotide sequences j which encode the DNA sequence of the < 
desired mutation, as well as a sufficient number of adjacent nucleotides, to-provide a 
< r 20 primer sequence of sufficient size and sequence complexity to form a stable duplex on 
both sides of the deletion junctioa being traversed. Mutations may be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise 
change -the properties of the polynucleotide itself, and/or » alter the properties, activity, 
f composition, stability, or primary sequence of the encoded polypeptide, - 
25 In certain embodiments of . , the preset, mAfentdon, , the inventors 

contemplate the. mutagenesis t of the disclosed [polynucleotide, sequences to alter one or 
more properties of the ! encbded- polypeptide, Siiph as. the imnaunpgenicity of a 
polypeptide vaccine. The techniques of siterspec£fic mutagenesis are well-kqown in the 
art; and are widely used to create variants of . bothi polypeptides and polynucleotides. 
■'.■30/ For example, site-specific mutagenesis is often used^to .felter a specific portion of a DNA 
rmdlecule. In such embodiments, a primer comprising, typically about 14 to about 25 
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nucleotides or so in length is employed, with about 5 to about 10 residues on both sides 

of the junction of the sequence being altered. 

As will be appreciated by those of skill in the art, site-specific 

mutagenesis techniques have often employed a phage vector that exists in both a single 
5 stranded and double stranded form. Typical vectors useful in site-directed mutagenesis 

include vectors such as the Ml 3 phage. These phage are readily 

commerciaUy-available and their use is generally well-known to those skilled in the art. 

bouble-stranded plasmids are also routinely employed in site directed mutagenesis that 

eliminates the step of transferring the gene of interest from a plasmid to a phage. 
1$ ' ' " in general, site-d^rcc3ed 'mutagenesis ' in' accordance herewith is 

performed by first obtaining a single-stranded vector or melting apart of two strands of 

' 3 a double-stranded i vector that includes within its sequence a DNA sequence that 

....... . . --, - : :>■ ' >., y-" :■. : .■(>■■ r !.-•■■■■ 

encodes the desired peptide. An oligonucleotide primer bearing the desired mutated 

"sequence is prepared, generally synthetically.^ This primer is then annealed with the 
15 sinele-stranded vector, and subjected to DNA* polymerizing enzymes such as E. coli 
polymerase I Klenow fragment,' in order to complete the synthesis of the mutation- 
bearing strand. Thus, a heteroduplex is formed wherein one strand encodes the original 
non^rnutated sequence and the second sixan'd bears the desired mutation. This 
heteroduplex vector is then used to transform appropriate cells, such as R coli cells, and 
20 clones are selected which include recombinant vectors bearing the mutated sequence 
arrangement. 

The preparation of sequence variants of the selected peptide-encoding 
DNA segments using site-directed mutagenesis provides a means of producing 
potentially useful species and is not meant to be limiting as there are other ways in 

•25 which sequence variants of peptides and the DNA sequences encoding them may be 
obtained. For example, recombinant vectors encoding the desired peptide sequence 
may be treated wim mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachings of lialoy eVa/., 1994; Segal) 1976; Prokop and Bajpai, 1991; Kuby, 1994; 

30 and Maniatis et al . ', 1982, each incorporated herein by reference, for that purpose. 
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As used herein, the term "oligonucleotide directed mutagenesis 
procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a specific nucieic acid molecule 
relative to its initial concentration, or in an increase in the concentration of a detectable 
5 signal, such as amplification. As used herein, the term "oligonucleotide directed 
mutagenesis procedure" is intended to refer 'to a ' process that involves the 
template-dependent extension of a primer molecule. The term template dependent 

V™?* refers to nucleic acid synthesis of an RNA or a DN A molecule wherein the 

sequence, of the newly synthesized strand of nucleic acid is dictated by'the well-known 

10 rules of complementary base pairing (see, for example, Watson, 1987)! " Typically, 
vector mediated methodologies involve the introduction of the nucleic acid fragment 
into a DNA or RNA vector, &e clonal amplification of the vector, and the recovery of 
the amplified nucleic acid fragment. Examples of such ^ memodologies are provided bv 
{ IJ. S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 

'f,. , ^■^?^ r ^°^} i .J OT the production of polypeptide variants of the 



present invention, recursive sequence recombination, as described in U.S. Patent No 
5,837,458, may be employed. In this approach, Iterative cycles of recombination and 
.screening or selection are performed to "evolve" individual polynucleotide variants of 
the invention having, for example, enhanced immunogenic activity. ' 



20 . v , r t . , In other embodiments of the present invention, the polynucleotide 
sequences provided herein can be advantageously used as probes or primers for nucleic 
acid hybridization. As such, it is contemplated that nucleic acid segments that comprise 
or consist of a sequence region of at least about a 15 nucleotide long contiguous 
sequence that has the same sequence as, or is complementary to, a 15 nucleotide long 
25 contiguous sequence disclosed herein will find particular utility. Longer contiguous 
identical or complementary sequences, e.g., those of about' 20,' 3<if 40, 50, 100 200 
500, 1000 (including all intermediate lengths) and even up to full length sequences will 
also be of use in certain embodiments. ^ 

The ability of such nucleic acid probes to specifically hybridize to a 
SO sequence of interest will enable them to be of' use in "detecting the presence of 
complementary sequences in a given sample.' However, otheruses are also envisioned, 
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such as the use of the .sequence information for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. r 

Polynucleotide - molecules having sequence, regions consisting of 
contiguous nucleotide stretches of 1 0-14, 1 5-20, 30, 50, or even of 1 00-200 nucleotides 
5 or so (including ' intermediate lengths as well), identicals .or complementary to a 
. ■ - polynucleotide sequence disclosed < herein, are . particularly contemplated as 
•'.,,« nybridization probes for use in, e.^.j Southern and Northern blotting; This would allow 
a gene product, or fragment thereof, to be analyzed, both in diverse cell types and also 
in Various bacterial cells." me total size of-fragment, as well as the size of the 
10 -^omplbiieritaiy stretches), will ultimately (depend omthe intended use or application of 
^partTcuIaf^u^eic-:acid,segment .£• .SfflrfleflftagBieirfs.. will ..generally find use in 
' ' ^hybridization 'embodiments, wherein the length of the contiguous complementary 
region may be Varied, such as between about 15 and about 100 nucleotides, but larger 
; contiguous complementarity stretches may be mused, according to the length 
15 complementary sequences- one wishes to detect'- . '= > .•'>-'» nvo v " i ! 

V ; The use of a hybridization^probe: ofeabout! 15-25 nucleotides in length 
^ f -alldws me forrhation© is.;bdm,sia&18<and selective. Molecules 

' ™*hkffi& Contiguous complementary' sequencesiover stietches greater; man 15 bases in 
length are generally preferred, though'; in order to increase stability and selectivity of 
20 — the hybrid, and' thereby improve the quality, and degree of specific hybrid molecules 
•obtained. One will generally prefer to design nucleic acid molecules having gene- 
' ^ complementary stretches of 15' to 25 contiguous nucleotides, or even: longer where 



desired. 



■ < : - Hybridization probes may be selected from any portion of any of the 
25 : sequences disclosed herein. All that is required is to review the sequences set forth 
herein, or to any continudus portion of the sequences, from about 15-25 nucleotides in 
length up to arid including the full length sequence, that one wishes to utilize as a probe 
- or prirrieW The choice i'ef probe and primer sequences may be .governed. by various 
factors'. For example, One may wish to employ primers from towards the termini of the 
30 total sequence. Ji u 
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Small polynucleotide segments or fragments may be readily prepared by, 
for example, directly synthesizing the fragment by .chemical means, as is commonly 
practiced using an automated oligonucleotide synthesizer. Also, fragments may be 
obtained by application of nucleic add reproduction technology, such as the PCR™ 
technology of Ui S. Patent 4^683,202 (incorporated herein by reference),, by, introducing 
selected sequence* dnto recombinant vectors, for recombinant production, : and by other 
recombinant DNA techniques. generally knownito those of skill in thwart of molecular 
■ -.biology. •..;( > ,• ■ . .■ ... . ... , • ,. 



; ; ' : TTie .nucleptide sequences of the invention may be used fpr their ability 

•• % ■ i ! 0 ; to selectively fbrmiduplex molecules with complenaentary stretches of the entire gene or 
gene fragments of; interest.- Depending on the? application enyisipnedj one will typically 
v , . desire to employ varying conditions of hybridization :to achievevy^rying, degrees of 
g J ' selectivity of probe, towards r, target . sequence, -F<*r .^pIi«it[o^s..4^i^ring Jiigh 
selectivity^ que twill typically desire to: employ, relatively stringent conditions to ; form 
15 the hybrids, e.g., one will select ^relatively low salt and/or high temperature conditions, 
rfcg/i&: >u such as provided by a:salt/CoiieeptratiQn : off#-Qm i about 0.02 M to about 0.15 M salt at 
Mj '■ temperaturear.ofe.ffrom.abbuti S09Q \to-f&mgt,-W?£. . Such; se |ectiye ; conditions tolerate 
; i ? little, if any^ ntisrriatch between ; the:!piobe : and the template or target sjrandjjand . would 
i sbe particularly suitable for isolating related, sequences.^ , , , 
20 *•" ■ ".Of course, for, some applications^ for example,, where lone desires to 

prepare . mutants employing ■ a- mutant primer strand hybridized to ao.,underlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 
needed in order to allow formation of the heteroduplex. In these circumstances, one 
may desire to employ salt conditions such as those of fronj. about 0. 1 5 M to about 0.9 M 
25 salt, at temperatures ranging from about 20°C to about £5.«?C. <>Qss-hybridizing species 
can thereby be readily identified as positively hybridizhagsignals with respect to control 
hybridizations. , In any case* it is generally ^ppreciatedj that conditions jcan .jbe rendered 
more stringent by, the^addition -of i^cfeasing^ounts . pf.fojinainide, -which serves to 
destabilize .the hybrid duplex in. the,; samemann^aste Thus, 
30 hybridization conditions can be readily manipulated, and thus ,will, generally be a 
method of choice depending on the desired results. 
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According to another embodiment of the present invention, 
polynucleotide compositions comprising antisense oligonucleotides are provided. 
Antisense oligonucleotides have been demonstrated to be effective and targeted 
inhibitors of protein synthesis, 4 and, cbnsequently, provide a therapeutic approach by 
5 which •a'disease cah be treated by inhibiting the synthesis of proteins thdt' contribute to 
Wdisease. The efficacy bf antisense oligonucleotides for inhibiting protein synthesis 
is well established. For example, the synthesis of polygalacturonase and the muscarine 
type i'-a^liAoliite'rec^r'te;^^^^ antisense oligonucleotides directed to 
their respective mRNA sequences (U. S. Patent 5,739,119 and U. S. Patent 5,759,829). 
10 ' ^urthCTi'^am^les"^ an*sense inhibitiori have been demonstrated with the nuclear 
• p&teffi cyc^n, the multiple drug resistance gene (MDG1), ICAI^li E-selectin, STK-1, 
;b "4triltSf GABAa receptor and human EGF (Jaskulskf et al, Science. 1988 Jun 
"10;240(4858): 1544-6; Vasanthakumar and Ahmed, Cancer Commun. 1989;1(4):225- 
32; Peris et'al., Brain Res Moi Brain ReS.' 1998 Jun 15;57(2)-3 10^20; U. S. Patent 
15 5;80T,1 54; "U.S. Patent 5,789,573Tu. S: Patent 5,718,709 and U.S. Patent 5,610,288). 
v " Atttisense Constructs have also'beeii described''that in^iblf and can be used to treat a 
-Variety of abnormal cellular p^lif^rls;-^ canW^S. Patent 5,747^470; U. S. 
Patent ^^9X317 and U. S. Patent ^78^683^ ^ ' '' " ' 



1 ; 1 Therefore, in certain embodiments 1 , the present invention provides 
i0 oligonucleotide sequences that comprise all, or a portion of, any sequence that is 
? capable of specifically binding to 'polynucleotide sequence described herein, or a 
complement thereof. In one embodiment, 1 the antisense oligonucleotides comprise 
DNA of derivatives thereof; In another embodiment, the oligonucleotides comprise 
RNA or derivatives therteof. In a third embodiment, the oligonucleotides are modified 
2$ DN As comprisirig a phbsphorothioated modified backbone. In a fourth embodiment, 
' Ae : 6iig^nucl'eo^ Se^e : rices -comprise peptide nucleic acids or derivatives thereof: In 
each cose; preferred com'positidns comprise a s^ that is complementary, 

iid' more preferably ' 'sub^tially^mplem-entaiy; and even more preferably, 
completely c6m : pierrientary to one or mbfe 'portibn's of polynucleotides disclosed herein. 
30 SeiectioiP of arkisense compositions 'specific for a given gene 1 sequence is based upon 
analysis of the chosen target sequence and determination of secondary structure, T m , 
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binding .energy, and relative stability, Antisense compositions may be selected based 
upon their ; relative, inability to fonn djmers, hairpins, or other secondary structures that 
i ™>uM reduce or prohibit specific finding ,tp the t^f f mRNA t in a host cell. Highly 

, ; . preferred target regions of the mRNA^ are those which are at or near the AUG 
f 5 translation initiation codon, and those sequences which are substantially complementary 
to 5' regions of the mRNA. These secondary structure, analyses and target site selection 
. considerations can be performed, for example,, using v,4 of the . OIJQC/ primer analysis 
#v /a oSoftware^d^r.^e.B^^TO.j2.0.5 3^^,39^^.(411^ fyal, Nucleic Acids 
^s.1997,25(17):338^402).." ... . ,. , . r . ' '.\ 

i: ^>i^° ■ ' : m ?Tk&,JB£J(t ^.>fp$sense t 4eUy^ ; ^e^od,.^lQ^iiig. ^..^prt peptide 
v ;i t ; vector, termed MPG (27 residues), is alsp ; contemplated, Th,e M?G. peptide^ contains a 
hydrophobic dornajn deriyed froirifthe fusion sequence pf HIY.gr^l and.a hydrophilic 
. .: . < domain from the. nucleay ocaHzation sequence of Sy40 T-antigen (Mprris et ah, 
,.0/.. . ■< Nu 9 lei c Acids Res. 1997 Jul, 15;2 f 5(14):2730-^). It has been demonstrated that several 
r: ) ^ ; M 0t JnQlecules of the MRG peptide cgaj the antjsense oligonucleotides and can. be. delivered' 
1 3f&-u et ^twedj^in^^ian,^ jttian J...hour with relatively high efficiency (90%). 
••• ■ '' ../Ci further, . the, inte^ti^wit^ Stability v pf the 

oligonucleotide to nuclease and ^e ability to cross the plasma merflbrarje. r . . .^ 

. Accordmg tQ u another embodiment of the invention, the polynucleotide 
20 compositions described herein, ^re used Jn the design , and preparation of ribpzyme 
molecules for inhibiting expression pf the tumor polypeptides and, proteins of the 
present invention in tumor cejls, Ribozymes are RNA-protein complexes that cleave 
nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that 
possess endonuclease activity (Kim and Cech, Proe Natl Acad Sci USA. 1987 
Dec;84(24):8788-92; Forster and Symons, Cell. 1987 Apr 24;4?(2);2 11.20). For 
- example^ a large number of ribozymes accelerate phosphoester transfer reactions with a 

hi ^ h df»« e . of |pecifici^y, often, ; deaving pnly one of several phpsphoesters in an 
oligonucleotide substrate (Cech e( al . Cell. 1981 ,Dec2.7(3 Pt 2):4S7-96; Michel and 
,. ; Westhof, J, Mpl Bjp^ 1990, Pec 5;2 : 1<K3):585-61Q^^ Shub, Nature. 

• ■ 10 ; ?? M a7 .H;357(6374): 173-^6). This specificity has been arh-ibuted to, the requirement 
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that the substrate bind via specific base-pairing interactions to the internal guide 
sequence ("IGS") of the ribozyme prior to chemical reaction. 

' Six basic varieties of naturally-occurring enzymatic RNAs are known 
presently. Each can catalyze the hvdrolysis of RNA phosphodiester bonds in trans (and 
5 thus can cleave other RNA molecules) under physiological conditions. In general, 
enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs 
through the target binding portion of a enzymatic nucleic acid which is held in close 
proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. 
TTius, the enzymatic nucleic acid first recognizes and then binds a target RNA through 
10 complementary base-pairing, and once bound to the correct site, acts enzymatically to 
" cut the target RNA. Sttategic cleavage of such a target RNA will destroy its ability to 
direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and 
cleaved its RNA target, it is released from that RNA to search for another target and can 
repeatedly bind and cleave new targets. 



W 7 ' ' fhe enzymatic nature of a ribozyme is advantageous over many 



technologies, such as antisense tecbuo^gy (w^e a imcleic acid molecule simply binds 
1 " to f nucleic acid f target to block its 'translation^ since the concentration of ribozyme 
" necessary to affect a therapeutic treatrhent' is lower than that of an antisense 
oligonucleotide. This advantage reflects the ability of the ribozyme to act 
20 enzymatically. Thus, a .single ribozyme molecule is able to cleave many molecules of 
target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity 
of inhibition depending not only on the base pairing mechanism of binding to the target 
RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 
substitutions, near the site of cleavage can completely eliminate catalytic activity of a 
25 ribozyme. Similar mismatches in antisense molecules dp not prevent their action 
(Woolf etal. Woe Natl Acad Sci US A. 1992 Aug 15;89(16):7305-9). Thus, the 
' : specificity of action of a ribozyrne is greater than that of an antisense oligonucleotide 

binding the same RNA site. . ... 

The enzymatic nucleic acid molecule may be formed in a hammerhead, 
30 hairpin, a hepatitis 8 virus, group I intron or RNaseP RNA (in association with an RNA 
guide sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are 
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described by Rossi et al. Nucleic Acids Res. 1992 Sep 1 1;20(17):4559-65. Examples of 
hairpin motifs are described by Hampel et al! (Eur. Pat. Appl. Publ. No. EP 0360257), 
Hampel and Tritz, Biochemistry 1989 Jun i'3;28(12):4929-33; Hampel et al, Nucleic 
Acids Res. 1990 Jan 25;18(2):299-304 and U. S. Patent 5,631,359. An example of the 
hepatitis 8 virus motif is described by Perrotta and Been, Biochemistry. 1992 Dec 
1;31(47):1 1843-52; an example of the RNaseP motif is described by Guerrier-Takada 
et al, Cell. 1983 Dec;35(3 P^' V 2):849-57; Neurospora VS RNA ribozyme motif is 
described by Collins (Saville and Collins, Cell 1990 May 18;61(4):685-96; Saville and 
Collins, Proc Natl Acad Sci U S A. ^ and Olive 



10 Biochemistry. 1993 Mar 23^1 1):279^ of the Group I i„tron is 

described in (U. S. Patent 4,987;67 j)! Alfthat'is important in an enzyinatic Nucleic acid 
molecule of this invention is that it has a specific substrate' finding" site which is 
complementary to one or more of the target gene RNA regions, and' tfiat it have 
nucleotide sequences within or 'surrounding' that substrate binding site which impart an 
RNA cleaving activity to the molecule. 3 Thus ihe ribozyme constructs need not be 
limited to specific motifs mentioned [ herein. '' 1 

Ribozymes may be designed as described in Int Pat.' Appl. Publ No 

t 6 J 3/23d ^ . R U 94^25^ eacii^ecifically 
incorporated herein by reference) and 'synthesized to "be tested ht vitro and 'in vivo as 
described. Such ribozymes can also be optimized for delivery. While specific 
examples are provided, those in the art will recognize that equivalent RNA targets in 
other species can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of the 
ribozyme binding arms, or chemically synthesizing ribozymes with modifications that 
prevent their degradation by serum ribonucleases (see e.g., Int. Pat. Appl.' Publ. No. 
WO 92/07065; Int. Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl Pub). No" WO 
91/03162; Eur. Pat. Appl. Publ. No. 92il0298.4; IL S. Patent 5,334,711; and Int. Pat. 
Appl. Publ. No. WO 94/13688, which describe various' chemical modifications that can 
be made to the sugar moieties of enzymatic RNA molecules), modifications which 
enhance their efficacy in cells, and removal of stem II bases to shorten RNA synthesis 
times and reduce chemical requirements. 
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Sullivan et al. (Int, Pat. Appl. Publ. No. WO 94/02595) describes the 
. general methods for delivery of enzymatic RNA molecules Ribozymes may be 
administered to cells - by a variety of methods known to those familiar to the art, 
including, but not restricted^, encapsulation in liposomes, s by iontophoresis, or by 
5' incorporation mto other vehicles, such as hydrogels, cyclodextrins, biodegradable 
. . . . nanocapsules,and bioadhesive microspheres, For some indications, ribozymes may be 
directly delivered ex vivo Ao ^cells . or tissues with or without the aforementioned 
u • . ; veMctesi Alternatively,./^ >RN A/vehicle combination may be locally delivered by 
' direct inhalation, by direct injection or- :by ; use, of a , catheter, infusion pump or stent. 
10 ! Other 'routes of delivery- include, but . are not limited to; intravascular, intramuscular, 
subcutaneous or joint injection; aerosol .inhalatiom oral, (tablet or pill: form), topical, 
-^st^^o^^tetr^erhoaeal ahd/br intrathecal delivery. - t More detailed descriptions 
. of ribozyme^delivery and administration are provided inlnt. Pat. Appl. Publ. No. WO 
- . ' 94/02595 and Int. Pat. Appl! Publ. No. WO 93/23569, each specifically incorporated 
15 ' herein by reference:^ - ^ ' " 

1 u .... < Another mean's, of accumulating high cpiKentrations of a ribozyme(s) 

: > vector^ Transcription of the ribbzyme- sequences are driven, from a promoter for 
<■« eukarydtic RNA polymerase I (pol 1), RNA pplymerase II (pol D), or RNApolymerase 
20 ffl (pol III). Transcripts from pol H or pol III promoters will beexpressed at high levels 
, in' all cells; the levels of a given pol II promoter in a given cell type will depend on the 
nature of the gene regulatory sequences • (enhancers, silencers, ^c.) present nearby. 
• Prdkaryotic 'RNA polymerase* promoters may also, be used, providing that the 
prokaryotic RNA polymerase enzyme is expressed in the appropriate ceUs Ribozymes 
25 ^ expressed from such promoters have been shown to function inmammalian cells. Such 
transcription units can be incorporated into a variety of vectors for introduction into 
^ ^ mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA 
i<l ■:< vectors (such as ao^novirus oradeno^associated vectors)^. viral RNA vectors (such as 
' ■-: rerrovfral, serriliki fbrest vifus; ; sindbis virus vectors). , ^ 

30 v : ' - u another embodiment of the invention, peptide nucleic acids (PNAs) 

, . -compost * & 
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attached'to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid Drug 
Dev. 1997 7(4) 431-37). PNA is able . to be utilized! in , a number methods that 
traditionally Have used RNA or DNA, .Often PNA sequences perform better in 
techniques man the corresponding RNA or DNA sequences and.have utilities that are 
5 not inherent.; to- RNA or DNA. A review -, of, PNA ■■• including , methods . of making, 
characteristics of, and methods of using, is provided! by, £orey (Trendy JSiofechnol 1997 
Jun; 1 5(6):224-9). As such, in certain embodiments, on© rhay prepare PNA sequences 
' that are complementary to one or more portions ofthe l&CU mRNA sequence, and such 
' PNA compositions may be used to regulate, waiter,, decrease, oj reduce! the. translation of 
W ACE-specific mRNA, and-thereby alter ;the.level.Qf AGE;actiyity jn a hoatcell to which 
r ;! such PNA compositions have beea^administered.:?; f ; t , -oi ; , : -j. 

■ V ' PNAs have. 2tamirioethyl-glycine < linkage? : : replacing ,. the normal 

phosphodiester backbone ;©f DNA (Nielsen et ah, Science .^X^^c. >6;2j54(5037):1497- 
c /500; Hanvey et. al, Science: 1992'Nov 27;258(^087);148;l-5; Hyrup and Njelsen, 
15 . Bioorg Med Chem. 1996 Jan;4(l):5-23). This chemistry .has- three important 
consequences: firstlyi, ifficbntrast to DNA or phosphorothipate oligonucleotides, PNAs 
n ' ' ; aire neutral molecuie^:«eai«ieHy,- PNAs^ate achirjtf, which avoids, the need to develop a 
stereoselective • synthesis; and othirdly, PNA , synthesis uses standard^, ©oe or; Fmoc 
protocols for solid-phase peptide ! synthesis, although; other jmethods, ; ^including a 
20 modified Merrifield memod,vhavie been used.,, y c 0 

' PNA monomers: or ready-made, .oligomers are , commercially available 
from PerSeptive Biosystems (Framingham, MA);*. PNA syntheses, by either Boc or 
Fmocjprotocols are straightforward using manual or automated protocols (Norton et dl. , 
■«-■ Bioorg Med Chem. 1995 Apr;3(4):437-45). The manual protocol Jends itself to the 
25 production of chemically modified PNAs or the simultaneous .synthesis of families of 
closely related PNAs. . , , . .. , .... .,. . . 

-i As with peptide synthesis, the success of a particular 1?^A synthesis will 
depend on the properties of the chcKsen , sequence. rFor exampje, while ; in theory PNAs 
can incorporate any combination of nucleotide base§, the t prssencejof adjacent purines 
30 T can>aead to^ deletions . Of one or more residues in the product In expectation of this 
^difficulty, it is ^suggejsted ithat^ in producing PNAs; with adjacQjpt purines, one should 
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repeat the coupling of residues' likely to be added inefficiently. This should be followed 
by the purification of PNAs by reverse-phase high-pressure liquid chromatography, 

providing yields and purity of product similar to those observed during the synthesis of 

., , , . / ••• ■ . • •:'< ■ - ■ '' ' ,■'.>! 

peptides. 

5 1 Modifications of PNAs for a given application may be accomplished by 

coupling amino acids during solid-phase synthesis or* by attaching compounds that 
contain a carboxylic add group to ^ me exposed N-tenninal amine. Alternatively, PNAs 
can 1 be modified after synthesis by coupling to an introduced lysine or cysteine. The 
: ease with which PNAs can : be 1 ihb<tt5^d"1Si<^itates optimization for better solubility or 
W'^&i specific functional requirement: ' 'OAce'synlfiea^d, the identity of 'PNAs and their 
0 b derivatives can be confirmed . by mass "spectrometry; J Several studies have made and 
^' utUirod r modificatibns^ ^ of : Pr^As'<f6r example/ Nortbh et dl:, Bidorg MedChem. 1995 
!t ' Ap'r;3(4):437-45; Petersen et al, J Pept Scil 1995 May-Jun; 1(3): 175-83; Orum et al, 
'■■ Biotechriiquds. 1995 Sep;19(3):472-80; Footer ei al, Biochernistfy. 1996 Aug 
15 20;3 5(33): 10673-9; Griffith ei al, Nucleic Acids Res. 1995 Aug li;23(15):3003-8; 
i ^Wdridge 1 ^ al, Proc atl ; Acad'" &3 U^'A^i'^ Jun 6#2(12):5592!-6; Boffa'e/ al., 
Ml Pr6c Natl" Acad Scr 0 S AL 1995 ^^^4^320^901-5^ j Gambacolru-Passerini et al, 
;S " BfeoU ( 199o Au^ Natl Acad Sci USA. 1997 

J Nov 11 ;94(23): 12320-5;" Seeger et a/./kiotechniques. 1 997 Sep;23(3):5 12-7). U.S. 
20 l Patent Nd. 5,700,922 discusses PNA-DNA-PNA chimeric mblecules and their uses in 
' - diagnostics, modulating protein in organisms, and treatment of conditions susceptible to 
therapeutics. • • «' 

Methods\>f characterizing "the antisense binding properties of PNAs are 
discussed in Rbse (Anal Chem. 1993 Dec 15;65(24):3545-9) and Jensen et al 
25 - : (Biochemistry. 1997 Apr 22;36(16):5072-7). Rose uses capillary gel electrophoresis to 
•determine binding of PNAs- to their complementary oligonucleotide, measuring the 
: ' ^relative binaing kinetics and sioiehiometry. Similar types of measurements; were made 

by Jer^&nVai usm'^BIAcdire^'techrlolo'gy/ 
t • ■ r - : other"' applications of PNAs that have been described and will be 

30 "■ apparent to the stalled artisan include use in DNA strand invasior^ aiitisehse inhibition, 
v mutatibiial analysis, enh^ acid purification, isolation of 
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transcriptionally active genes, blocking of , transcription factor binding, genome 
cleavage, biosensors, in situ hybridization, and the lijce. 

Polynucleotide Identification, Characterization and Expression 

. , .> 

Polynucleotides -compositions of the present invention may be identified, 
prepared and/or manipulated using my of a yapety^of \yell established techniques (see 
; v generally, Sambrook et £*L, Molpculgr Cloning: A Laboratory Marmot, Cold Spring 
^Harbor Laboratories, Cold Spring, Harbor^ NY, 1989,, and other likp References). For 
example, a polynucleptide may b^jjdeptifijed, .a$ described^ by 
r . ; ^eejxing a micro^ay pf ^ that 
10 . is ajt least two fpjd .greater \n t a tympr^han in^n^al tissue^as detepnined using a 
representative assay provided herein).. Such screens may be peiforn^d t? ; for example, 
( using the npicrozujay t^chjio^gy of Affymetrix, Inc. (Santa Clara* CA) ^ according to the 
, manufacturer's ^instructions (and essentially r as,- described by Schena.et al. ? Proc. Natl 
^, Acad <ScL L^^3:l0614rlp619,,l?96 and Helle^,et : ^.,,Pw^ Nptl^AcafL Sci., USA 
JS ^¥:215^ polynucleotides may be amplified from cDNA 

, , ? prepared f^qpi, pjsjls^x^^ syiqh a§|yinqc cells. ^ 

p ;■ t Many template d^j)^^^ent r proG€jSses are ayail^ble^to ^lplify a target 

sequences of.ii^t^est present in a^s^mple. One of the best known amplification methods 
is the polymerase chain reaction ,,(PCR™) which is ; described in detail in U.S.- Patent 
20 , Nos. 4,683,195, 4,683,202 and, ,4,800, 159, each of which is incorporated herein by 
reference in its entirety. Briefly, in PCR™, two primer sequences are? prepared which 
are complementary to regions ( on opposite complementary strands of the target 
sequence. < An excess of ^eoxypucleoside triphosphates is added to a reaction mixture 
along with a DNA polymerase (e.g., Tag pplyiperase). If the target sequence is present 
25 in a sample, the primers \yill bind to- the ^rget ^nd ; thq polymerase will cause the 
.... primers to be extended along r the target sequence by ^d^Jmg. on nucleotides. By raising 
and lowering the temperature of the re^ctipn mixture, the. extended primers will 
dissociate from, the target to fomi^reaptioix prpducts, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
3Q , .transcription and, PCR™ amplific^tio^procediire maj/ be performed Jn order. to quantify 
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the amount of mRNA amplified. Polymerase chain reaction methodologies are well 
known in the art. : ; ,v/-,~ 

Any of a number pf other template dependent processes, many of which 
are variations of the PCR ™ amplification technique, are readily known and available in 
5 tfie art. Illustratively, some such methods include the ligase chain reaction (referred to 
[ ; as LCR), described, for example, in Eur. Pat Appl. Publ. No. 320,308 and U.S. Patent 
i No. -:4,883,750; Qbeta Replicase, - described in. PCT Intl. Pat. Appl. Publ. No. 
q ; PCX/US 8 7/008 80;- Strand Displacement Amplification (SDA) and - Repair Chain 
: Reaction (RCR). Still other amplificatipn methods are described in Great Britain Pat. 
^Or,; -Appl.,No:: <2>/2Q2 r328,, and;i» ,PCX..totl/Pat.; Appl.. Publ. No, PCTAJS89/01025 OiOfher 
i ; v; nucleic acidamplification procedures include trajiscription^based amplification systems 
; j i (TAS) (PCT totl. Pat. Appl. PubltNoi WO 88/10315), including nucleic acid sequence 
s based amplification (NASBA) and 3 SR. Eur, Pat. Appl, Publ. No. 329,822 describes a 
.., nucleic acid * amplification process involving ; cyclically synthesizing singje-stranded 
15 /RNA ("ssRNA"), ssDNA, and ' double-stranded 7 DNA (dsPNA). PCT Intl, Pat- Appl. 

Publ. No.; WO 89/06700? describes a nucleic acid sequence; amplification; scheme based 
/ r/ on? the; hybridization of a pfomoier/prfmer sequspce to a. target single-stranded DNA 
■"H o ^ssDNA 11 ) ^followed by transcription of many RNA copies of the sequence. Other 
: ; i amplification methods Such as "RACE" (Frohman, 1990), and "one-sided PGR" (Ohara, 
20 ;1 989) are also well-known to those of skill in the art. - / ; 

v An amplified portion of apolynucleotide of the present invention may be 

vised Jto isolate a full length gene from a suitable library (e.g., stumor cDNA library) 
using well known techniques. Within such techniques, a library (cDN A or genomic) is 
i screened using one or more.polynucleotide probes or primers, suitable for amplification. 
25 ; . Preferably, : a library is size-selected to include larger molecules. Random primed 
: libraries toay ,also be preferred for identifying 5 1 and upstream Tegibns of genes. 
- Genomic libraries are preferred for obtaining introns atnd extending 5 1 sequences. 
..•*,■• -,',r : For hybridization techniques, a partial sequence may be labeled (e.g., by 
nick-translation or end-labeling with 32 P) using> well known techniques. A bacterial or 
30 bacteriophage -library is then generally screened by hybridizing filters containing 
denatured bacterial colonies (or lawns containing phage plaques) witja the labeled probe 
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{see Sambrook <*t ah; Molecular Cloning: A Laboratory Manual Gold Spring Harbor 
Laboratories, Cold Spring Harbor, NY, 1989). Hybridizing colonies or 'plaques are 
selected and e&prinded, and the DMA is isolated for fiirther analysis. cDNA clones may 
be analyzed to deteJrmine the amount of additional sequence by, for- example, PGR 
5 ( using a primer from the partial sequence^ and a pnmerl&om the vector. 5 Restriction 
maps and partial sequences may be generated! to identify one- ^r riiore overlapping 
clones. The complete 'sequence^May then -be determined using standard techniques, 
which may involve "generating a series of deletion clones; The resultiilg: overlapping 
" i/' fG Sequences can then assembled irfto a single contiguous sequence ' A frill length cDNA 
g *10 ^ mdlecule* can be^ene&tSd fejrKgatirig suJtable 'fragt^ents, usiiig -Well *novyn:techritiques. 
' 1 J : ' . ; A^eiiativelyp amplification; tec*miques,^ such: as^ those described above, 
^ ; .-' } . ' c^n be useful for obtaining a full letfgth? coding \ sequence 7 fram'i a., partial cDNA 
sequence. One sucli ^amplification technique ii ^itiVe^se■F€R■(i5^:Triglk:.et al., M/c/. 
y Acids Res. 7^:8186,1988), which usesirestriction enzymes to generate a fragment in the 
15 : * known rejgioii of ^efeene. IThejfragmeni is then circularized by intramolecular ligation 
b^o t ( andmseid a$ ia template^ P£R with divergent primers derived from the known region. 
o'^U uor* Within, an alternative ^jsptQaeh, ? sequencer adjacent to a partial sequence may be 
•ft > c retrieved by amplification witb a primer to a linker sequence and a prhner . specific to a 
! known region. The amplified Sequences , are typically subjected to a second round of 
20 amplification with the same lihker primer and # second primer specific to the known 
a region. ' A -variation on this procedure, which employs two primers that initiate 
extension. in opposite directions from the .known ^sequence, is des6ribed in WO 
f 96/38591. Another such .technique is known as "r^pid amplification, of cDNA ends" or 
RACE.? This technique! involves the use of an internal primer and an external primer, 
25 which hybridizes to a polyA region or vector sequence, to identify, sequences that are 5* 
i and 3 ! of a known sequence. ^Additional techniques include* capture fCR (L&gerstrom et 
al., PCR Methods Apptic. , 1 :VU 49^1991 ) arid walking PGR (Parker et al . , Nucl Acids. 
Res. 79:3055-60, 1991). Other - methods < employing amplification may also be 
. .employed to, obtain a full length cDN A "sequencer fc V ; > l t s ,< . 
30 { -In certaih instances, it is pos^ 

u.: by analysis 6f;sequences:provided in am expressed sequence tag (EST) database, such as 
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that available from GenBank. 1 Searches for overlapping ESTs may generally be 
performed using well known programs (e.g., NCBI BLAST searches), and such ESTs 
may be used fb' generate a contiguous full length sequence. Full length DNA sequences 
may also be obtained by analysis of genomic fragments. ' 
5 ' lu ; other 'embodiments "bf me invention, polynucleotide sequences or 
' fragments thereof which encode polypeptides of the invention; or fusion proteins or 
' ""fun^orial ? 'equiv&leks' ra6reof, 5 niay'W used in recombinant DNA molecules to direct 
; expression Of a polypeptide in 'appropriate host cells. Due to the inherent degeneracy of 
! the 'genetic i! cbde, othei' DNA sequences that 1 ehcbde substantially the same or a 
ltS : " f&toi^oikdly' eqi&i^alerit 1 ifikhiiib a acid 's^uedoe may be produced and these sequences may 

1 : be : \iSed t6 "cldiie 'itod'eipfress a given polypeplidb; " - ' 
■y :in':± :v K v: \, »e ^jj ^ understood by those of skill in the art, it may be advantageous 

\ in some instances to produce polypeptide-encoding nucleotide sequences possessing 
non-tiataralty occurring codons. iFor example, codons preferred by a particular 
15 prolcaryotie or eukaryotic host can be selected to iricfease the rate of protein; expression 
; ! 6r to produce a reconibinaHt fcNA ! transcript having ' desirable properties, such as a half- 
W^M^ ? ^\6ngjsr man ( uia¥ ] df « l Mi^^^^i^''^ia-ibc naturally occurring 
sequence. 

* r ' ' "' Moreover, ihe polynucleotide sequences? J of the present invention can be 

20* engineered using methods generally known in the art in order to- alter polypeptide 
encoding sequences for a Variety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product. For 
example; DNA shuffling by ' random fragmentation and PCR reassembly of gene 
; ; fragments and synmetifc 'oKgonucleotides may be used to engineer tiV nucleotide 
25 : 'sequences. In addition!, site-directed mutagenesis may be used to insert new restriction 
1 sites, alter' giycbsyla^oh^patteinV^ang e codon preference^ produce splice variants, or 
introduce mutations, and so forth. 
■; .: i: i n : another embodiment * of" the * * invention, 4 natural, -modified, or 
; recombinant nucleic acid sequences may be ligated to a heterologous sequence to 
3(J Encode a fusion "protein" For example, to screen 1 peptide libraries for inhibitors of 
v ' : ' ; polypeptide activity, it may be usefur to encode a chimeric J protein" that can be 
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,,, recognized by ^ ; commerciaUy available antibody. A fusion protein may also be 
engineered to contain a cleavage site located between the polypeptjde-encoding 
sequence and ; the heterologous protean sequence, so that the polypeptide may be cleaved 
and purified away from the heterologous moiety. , 
5 Sequences, encoding a desired polypeptide may be synthesized, in whole 

pr in part, using chemical method^ well knOjWn in die art (see Qaruthers, M. H. et al. 
- (1989) Nucl. Acids Res. SymjK ^,2^23, Horn, T,, et . al. ,(1 §80) . NucL Acids Res. 
. . ■ 7 Symp. Ser. 225 r 232). ; Alternatively, ^protein itself may ? b^ prodded using chemical 
, , -methods t Q; synthesize . Jl^. ai^ino aci£,sequence pf a polypeptide, or a portion thereof, 
example, .peptide svnthesis r ^an be.perfoimcd; using various solid-phase techniques 
(Roberge, J. Y. et al. (1995) &^«^,,^P:202,204) and automated, syndesis may be 
, . achieved, for .example, i ^iising.tlie AB^431A Peptide^ Synthesizer (Perkin Elmer, Palo 
... AJto, CA). . v . . ; 



A ne ^y synthesized r . peptide may, be substantially purified by 
^ ^Wepai^v^^high.jp^o^anc^^uia .chromatography Je.g., Xreighton, T. (1 983) 
„ .Proteins, Stnicture| i ^dMplecula^ principles,, TO ^eman and Co., New York, NY) 
or.other. comparable , tecb^ue^ mailable. in. the art. The composition ,of, the synthetic 
peptides may be confirmed by amino acid analysis or sequencing (e.g.,jhe Edman 
degradation procedure). Additionally, the amino acid sequence of a polypeptide, or any 
20 , part thereof, may be altered., during direct synthesis and/or ,Combine.d using chemical 
methods with sequences from other proteins, or any part thereof, to produce a variant 

J ■ ' i "id.: 

Polypeptide. . 



In order to express a desired polypeptide, the nucleotide sequences 
, encoding the polypeptide,, or functional equivalents,, may be inserted into appropriate 
25 expression yector, i.e., a vector .which contains the necessary elements for the 

Qpding, sequence. .Methods which are well 
known to those skilled in the art may be used.tq construct expression vectors containing 
■ sequences encoding , a polypeptide ,of inter^t and appropriate transcriptional and 
- translational control elements, These methods include in vitro recombinant DNA 

' ■'*■ ■ - i K .-.i , u- ..lit 

10 • techniques^ synthetic techniques, and, in vivo genetic recombination. Such techniques 
...,are described, f^r example, in. t Sambrqok, J. et al. (1989). Molecular Cloning, A 
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Laboratory Manual^ Cold Spring Harbor Press, Plainview,„N.Y., and Ausubel, F. M. et 
al. (1989) Current Protocols in Molecular Biology, John Wiley . & Sons, New York. 



N.Y. 



A variety of expression vector/host systems may be utilized to contain 
5 and, express pd^eptidp. sequences, /These include, but are not limited to, 
microorganisms such as bacteria ^formed with recombinant bacteriophage, plasmid, 
or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 

"^^ct .peU^^^ems. ipifec^Bd _ with yip?s- ^l^?^?^ .^S^f 8 ^«c»^ ov ^« s); Pl«>* ce* 1 
" systems ^ transformed with virus expression vectors (e.g., cauliflower .mosaic virus, 
10 CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 

, ,.. pBI^22 plasmifls); or atumal peU svs|ems. ,. .. .y : 

The "control elements", or "regulatory sequences" present in an 
expression vector are those non-translated regions of me_vector--enhancers, promoters, 
5' and .3' untranslated regions-which interact with host cellular proteins to carry out 
15 transcriptional translation. Such etements^'m^y vary in their strength and specificity. 

" Depeiidmg'on ke vector system ^d^os^ utili^ ^nuniber of suitable transcription 
^ " . ^ promoters, may be used. 

For example, when cloning in bacterial systems, inducible promoters such as the hybrid 
. lacZ. promoter, of the pE|LUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or 
20 : pSPORTl plasmid (Gibco BRL^ Gaithersburg, MP) and the like may be used. In 
mammalian cell systems, promoters from mammalian genes or from mammalian 
, viruses , are generally preferred. If it is necessary to generate a cell line that contains 
multiple copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV 
may be advantageously used with an, appropriate selectable marker. , 
25 In bacterial systems, any of a number of expression vectors may be 

selected depending upon the use intended for the expressed polypeptide. For example, 
when large quantities are needed, for example for the induction of antibodies, vectors 
which. direct high. lcvkCTp^99 of Proteins that are readily purified may be 

used" Such vectore h^tade, bot"we not limited, to, the multifunctional K cpli cloning 
30 and expression vectors such as pBLUE^CRJPT (Strategene), in which the sequence 
encoding the polypeptide of interest may be ligated into the vector in frame with 
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sequences for the amino-terminal Met and the subsequent 7 residues of .beta.- 
galactosidase so that a hybrid protein is produced; piN Vectors (Van Heeke, G. and S. 
M. Schuster (1989) J. Biol. Chem. "2^:550 1-5509); and the like. pGEX Vectors 
(Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion 
5 proteins "with glutathione S-trarisferase (<3s¥j: in general, such fusion proteins are 

r , -r * * * m 

soluble and can easily be purified from ly&d cells by adsorption to glutathione-agarose 

beads followed by elution in the presence of free glutathione. Profeim made in such 

systems may be designed to include heparin, ferombin, or factor XA protease cleavage 

site s so that the cloned polypeptideof interest can %e released from the GST moiety at 
' 10 will. ' ' iC ' :/ ' • ' i ' ? : V<, i ■ ' 't. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha iacfor, alcohof oxidase, and PGH may 
be used. For reviews, 'see Ausubel et lall tWra) ari'd Grant et al. (1987) Methods 
U?i EnzjmdL 153:516^-544. X ! ' s ~" ! v ' c ' j - ^ ^ . . ,< 

13 ln cases where plant expression vectors are used, the expression of " 1 

' sequences encoding polypeptides may be driven by any of a number of promoters. For •' 
example, viral promoters such as the 35S and 19S promoters of CaMV' may be used 

'TM^ ttakamatsu, N/l 

(1987) EMBO J. 6':307-31 1. Alternatively, plant promoters ^such the Small subunit of ^ 

* t ■ . , ..,.■.„ 

20 RUBISCO or heat shock promoters may be used (Coruzzi, G. et (1S>84) EMBO J, 
5:1671-1680; Broglie, R. et al. (1984) 5«€nc« 22^:838-843 • and Winter, J. et al. (1991) 
Results Probl Cell Differ. 7 7: 85- 105). ' These constructs can be 1 introduced into plant 
cells by direct DNA fransformation or p^gen-me^a^'o^tfe^oni Such techniques 
are described in a number of generally available reviews (see^ for example, Hobbs, S. or 
25 Murry, L. E. in McGraw Hill Yearbook of Science arid Technology (1992) McGraw 
Hill, New York, N.Y.j pp. 191-196). ?s ' ! " ; : 

An insect system may also be used to eicpress 1 " a polypeptide of interest. 
For example, in one such system, Autographa calitorriida 'nuclear polyhedirosis virus 
(AcNPV) ! is used as a vector to express fdreigVgeries* m'Sp^d^pteia'feugip^da cells or 
30 in Trichoplusia larvae. The' sequences encoding the polypeptide ir^y' be clbned into a 
i- non-essential region of the virus; such as trie 'pblyhedrin gene, and placed lihder control 
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of the polyhedrin promoter.! Successful insertion of the polypeptide-encoding sequence 
will render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, for example,; S, frugiperda 
cells or 'Trichoplusia larvae in which, the polypeptide of interest may be expressed 
5 , :(EngelhardyE.KretaK(1994)^ 

. ■• , . < , . . o • i . ;In mammaHan host cells, a' number of viral-based expression systems are 
/ generally available.: For example, in cases where an adenovirus is used as an expression 

■ • . vector; sequences encoding a polypeptide of interest may be ligated into an adenovirus 
;/..^tnmsci^oBAianslation:.compleK. <^.8tmgo«f;-J«K* late, promoter and tripartite leader 

10 sequence. Insertion in a non-essential El or E3 region of the viral genome, may be used 
■ •to obtain a viable virus which is capable of expressing tihe polypeptide in t infected host 
t cells (Logan, -J 4 and Shenk,- T. £1 984) Proc. Natl Acad Sci: Si .-3655-3659). Jn addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
: to increase expression in mammalian host cells. , , 
15 ! t i9v ' Specific initiation .signals may .also be ,u$ed y to aclneve more efficient 

or- S translation, bfcsequences; encoding ' A.poi^m^ mteresk Such signals include the 
Am initiation «odon and adja<^.sequences4vto..e«^^^ •isequences.encoding the 
^lypepfide^ its ":inifiation. cbdoh< and -upstream isequences are > inserted into the 
-± appropriaie expression vector, no (additional transcriptional or translation^ control 
20 a'signals may be needed. However, ii* cases where .only coding sequence, or a portion 
thereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should ber provided. Furthermore, the' initiation codon should be in the correct 
reading frame to ensure translation of the entire /insert. Exogenous translational 
elements and initiation codons may be of various origins, ,both natural and synthetic. 
25 The efficieticy of expression maybe senhanced by the inclusion of enhancers which are 

■ ' apprdpriate for the particular cell system which is used, such as those described in the 
,!•<„> :Uteratuie-<&hart-D: et Bl.Xl994)Results-Probb Gell Differ. 20:125-162). 

. ; »;>;. hi addition, a host cell 'strain may be chosen.for its ability to modulate 

the expression: of the inserted 'sequences or, to process the. expressed protein in the 
30 - -desired fashion. Such modifications of the polypeptide include, ; but are; not limited to, 

■ acetylation, carbdxylationi glycosylation, phosphorylation, .lipidation, ! and acylation. 
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Post-translaubhal processing which cleaves a "prepro" form of the protein may also be 
used to facilitate correct insertion, folding and/or function. Different host cells such as 
€HGV COS/-HeLa, MDCK, HEK293j and WI38, < which have specific cellular 
; machinery and characteristic mechanisms for. such postradiational activities, may be 

5 chosen to ensure the correct modification and'processing of the foreign protein. 
' - - 1 For long-term, high-yield* production of recombinant proteins, stable 

expression is generally preferred. For example, cell lirfes. which stably, express a 
' 1 -polynucleotide of interest may f be tfansformed using expression; vectors, which may 
; 'Contain viral ^gins^f^plfcattW atfd/or.«ndogenous5 expres»o|hietements and a 

v 

o . 510': • selectable marker gene on the same or bn/a<separaie vector. Following the introduction 
• r " * ; of the vector, 'cejls : rnar bellowed to grow for l-2 days in an/enriched media before 
Ui : tliey are Switched to selective. media.-'The'ptttpo'se.M the selectable -markep.-is to confer 
/ v ' - 11 1 resistance to selection, arid its presence allows growth and' recovery of cells which 
successfully ^express the introduced- sequences. Resistant clones of stablyctransformed 
- ; 45 cells may be proliferated using tissue culture techniques appropriate to the cell type. 

" ' Any^mber ^of ;seiebtipn systems may be used to recover .transformed 

'' ,u - ir!x ' y ^ell lines. TheSe"^mclude,fi5ut^e! »qtoKihited , tcs the herpes implex Virus, thymidine 
r ' kinase (Wigler, M. et al. (W?T> Cell Jl :223-32) and adenine, phc^phoiibosy^ltransferase 

- (Lowy, I. et al. (1990) Cell 22:817-23) genes which <^an be employed in..tk.sup.- or 
V1 20 ! aprt.sup.- cells, respectively/i Also, antimetabolite, antibiotic or herbicide, resistance can 
1 f- -be used as the basis for selection; for - example,..; dhfr 'which confers resistance to 
methotrexate (Wigler, M;. et al. (1980) Proc. Nath Acad Sot 770567*70); npt, which 
confers resistance to the aminoglycosides;, neomycinand G-41 8 (Golbere-Garapin, F. et 
"al (1981) J. Mol. Biol J 50:1-14); and als ou pat^ -which confer resistance to 
25 chlorsuLfuron and phosphinotriciri acetyltransferasev/ respectively . (Murry, supra). 
Additional selectable genes have been described, ifov examplei trpB* which allows cells 
to utilize indole in place of tryptophan, or hisl>, which, allows .cells to, utilize histinol in 
place of histidine (Hartrnan,: S.cC; and R. €i : . Mulligan £1988) Proc. Natl. Acad. Sci. 
85:8047^-5 1). The use. of visible markers bargained popularity, wjtb such: [markers as 
• s ' 30 ; anthocyanins,-beta-glucuronidaseiarid its substrate GUS, and. luciferase and=its substrate 
! ' ' ; luc ifei-in, being widely used not only to identify transfprmants, but also to quantify the 
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Amount of transient or stable protein expression attributable to a specific vector system 
(Rhodes, C. A. et al. (1995) Methods Mol. Biol. 55:121-131). ' 

• Although the presence/absence ; of marker gene expression suggests that 
the eerie of" interest is also present, its presence and expression may need to be 

I , ■». . , ■ K * > i I * ^ 

5 confirmed. ' For example, if the sequence encoding a polypeptide is inserted within a 
' marker gene sequence; recombinant cells containing sequences can be identified by the 
abserice W marker gene^function. Alternatively, a marker gene can be placed in tandem 
with a polypeptide-ehcodmg sequence the control of a single promoter. 

Expression of the marker gene in response & induction or selection usually indicates 

10 expression of me tandem gene as well. 

Alternatively, host cells that contain and express a desired 
polynucleotide sequence may be identified by a variety of procedures known to those of 
skill in the art These procedures include, but are not limited to, DNA-ONA or DNA- 

f ; -l&A'hybridyitions arid protein bioassay or 'immunoassay -techniques 'which include, 

15 -for example, membrane, solution, or chip based technologies for the detection and/or 



quWificatibn of nucleic acid ot protein. 



A variety of protocols for detecting 4 an^ of 
! ;i polynucieotide-ericodea products/ using ' either polyclonal or monoclonal' antibodies 
Ipecific for the product are known in the art." Examples include enzyme-linked 
20 immunosorbent assay (ELISA), rkdioinununoassay (RIA), and "fluorescence activated 
cell sorting (FACS). A two-site, monoclorial-based ^immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on a given polypeptide may be 
preferred for some applications, but a competitive binding assay may' also be employed. 
These and other assays are described, arribrig other places, in Hampton, R. et al. (1990; 
25 Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. 
E. et al. ftW3;£ t kq>. Med 755:121 1-1216). 
: i; ' ' ? A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid assays. Means 
' for producing labeled' hybridization or PCR probes for' detecting sequences related to 
30 polynucleotides^^ mck translation, end-labeling or PCR 

amplification using a labeled nucleotide. Alternatively, the sequences, or any portions 
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thereof may be cloned into a vector for the production of an mRNA probe. Such vectors 
are known in the } art, are, commercially available, and may be used to synthesize RNA 
^ probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of 
5 commercially available kits. Suitable reporter molecules or labels, which may be used 
include radionuclides, enzymes, fluorescent, chemiluminescent, or chromoeenic agents 
as well as substrates, cefaclors, inhibitors, magnetic particles, and the like. 

Host cells transformed with a polynucleotide sequence of interest may be 
. cultured under conditions smteble for the expression and recovery of the pptein from 
10 cell culture. The protein produced by a recombinant cell may be secreted or contained 
intracellular^ depending on the sequence and/or the vector used. As will be understood 
by those of skill .in the art, expression vectors containing polynucleotides of the 

• - . . > . > t * .j - i j ; ... 1 / ( s. ^ i ; v.; 

f invention may be designed to contain signal sequences which direct secretion of the 
encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other 
recombinant constrictions may be used to join sequences encoding a polypeptide of 
interest to nucleotide sequence encoding a polypeptide domain which will facilitate 
purification of soluble proteins, ^uch purification facilitating domains include, but are^ 
not limited to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
20 immobilized immunoglobulin, and the. domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp., Seattle, Wash.). The inclusion of cleavable linker 
sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 
Calif.) between the purification domain and the encoded polypeptide may be used to 
facilitate purification. One such expression vector provides for expression of a fusion 
25 protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine 
residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion affinity chromatography) as 
described in Porath, J. et al. (1992, Prot. Exp. Purif. 5:2(53-281) while the. enterokinase 
cleavage site provides a means for .purifying the desired polypeptide from the fusion 
30 prot ( ein. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. 
et al. (1993; DNA Cell Biol. 72:441-453). 
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.•- > In r addition to recombinant production methods, polypeptides of the 
invention, and fragments thereof, may be produced by direct peptide synthesis using 
solid-phase techniques (Merriiield J: (1963) J. Am. Chem. Soc. 55:2149-2154). Protein 
synthesis may be performed using manual techniques or by automation. Automated 
5 synthesis may be 'achieved,. fbr,i example, using Applied Biosystems ,431 A Peptide 
Synthesiser = : (Perkin ; Elmer). Alternatively, ^various fragments may b<; chemically 
; synthesized separately and combined using; chemical methods to produce the full length 
v ; molecule; L •.•.»"!•(• •»? ! ?••>< • ' •• •■ ■-•■< " : • 

■1 1 * \ ^ . ' ; t' ' '':";''? . , - i 

Antibody Compositions, Fragments Thereof and Other Binding Agents 

10 i n , r*J According, do - ahother^aspept,^ the tpmsott intention further provides 
^ binding agents, such; as aptibodies and antigerirbinding fragments thereof, that exhibit 
, • i immunological binding to a- tumor polypeptide disclosed herein, or to a portion, variant 
or derivative thereof. An antibody, or antigen-binding fragment thereof, is said to 
"specifically bind," ^mmunogicalfy^bind/' and/orltis "immunologically reactive?' to a 
15 polypeptide of the invention if it reacts at a detectable level (within,^ for .example, an 

* BMSA^assay 1 ) with the polypeptide*" and rdoes^not ^yeact detectably with unrelated 
^-.'-^i^eptidfes under siiriila^cdnditioii[sj-- : ■. ' ■ - r :r 

v; ; \ : Immunological binding, as used in this context, generally refers to the 

' non-covalent interactiohs of the type.^ which occur between an immunoglobulin 
20 molecule and an antigen for which the immunoglobulin is specific. The strength, or 

* affinity of immunological binding interactions can be expressed in terms of the 
dissociation constant (Ka) of the interaction, wherein a smaller Kd represents a greater 
affinity: Immunological binding properties of selected polypeptides can be quantified 

- - using methods well knoAvn in the art. One such method entails measuring the rates of 

25 ? antigen-binding site/aritigeh ^ompfex formation and dissociation, wherein those rates 
'•<- depend on the concentrations of the complex partners, the , affinity of the interaction, 
arid on geometric parameters that equally influence the rate in both directions. Thus, 
>'" y both the ^ ,? 6ri ! rate constant 11 (K^n) arid^the "off rate constant" (K 0 ff) can be determined by 

- calculation 'of thfe concentrations and the actual rates of association and dissociation. 

30 The ratio of Koff /Kon enables cancellation of all parameters not related to affinity, and is 
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thus equal to the dissociation constant Kd.See, generally, Davies et al. (1990) Annual 

Rey. Biochenu 59:439-473. i i» 

!* An "antigen-binding site," or "binding portion" of an antibody refers to 

the part • of the immunoglobulin molecule lhat participatesj in antigen binding. The 
5 1 antigen binding 5 site is formed by amino acid Residues <6f -the Nrterminal variable ("V") 
regions of the heavy ("H") and lighShr(.! l L?). chains. Three highly divergent stretches 
within the V regions of the heavy and light chains ai» -refejcred to as, "hypervariable 
regions" which are interposed between more conserved flanking stretches, known as 
"framework regions," or "FRs". Thus the term "FR" refers to amino acid sequences 
JO which are^ naturally found . between and adjacent* to^ ihypervariablq regions in 
immunoglobulins. In an antibody molecule, the threejhypery^riable regions of a light 
4 * ch&in and the thrpe hypervariable regions of a hdavy chain are disppsed, relative to each 

< other inithree, dimensional space to form «n antigen-binding Surface; ; The antigen- 

^ binding surfaoesis complementary to the three-dimensional surface of a bound antigen, ;\ 
Oi \ 5r and the three hypervariabte ^ ' ; 

>... f n m , Binding i agents 'mky :i he tfbrthero :pap^ble; : of differentiating between % ;^ 

patients with and without a cancer, such as colon cancer, using the representative assays 
^provided herein. , For example, antibodies or other binding agents that bind to a tumor 
•*. 20 protein wilt preferably generate a signal indicating the presence of a cancer in at least 
'i o about 20% of patients with the disease, more preferably £t le^st about 30% of patients. 
Alternatively, or in addition^ the; antibody will generate, a, negative signal indicating the 
v absence of the* disease in at least about 90% of individuals without; the cancer. To 
; ; determine whether a binding agent satisfies this, requirement, biological samples (e.g., 
25 blood, sera, sputum, urine and/oB tumor biopsies) frojrnj patients with and without a 
cancer (as determined using standard clinical teste) may be^ assayed $s described herein 
for the presence -.-"'of; polypeptides ;that bind to ^e binding; agent Preferably, a 
• statistically significant number of samples with ,and withqut fl?e disease will be assayed, 
v Each binding agent should safe 
: . 30 fthe art will > recognize that binding agents may ibe used in; confirmation to improve 

• : sensitivity., ,a :o - . ?K V; ^ ,.■':•■■.•:-_*_ .,/\ . . ,'o • 
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Any agent that satisfies the above requirements may be a binding agent. 
For example, a binding agent may be a ribosome, with or without a peptide component, 
an RNA molecule of a polypeptide. In a preferred embodiment, a binding agent is an 
antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any 
of a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow 
and Lane, Antibodies: A I^b^rdiory^M 1988. In 

general, antibodies can be produced by cell cultufe techniques, including the generation 
of monoclonal antibodies as described herein, or via tfansfection of antibody genes into 
suitable bacterial or mammalian cell hosts,' in of der to allciw for' the production of 
10 fecombiharit antibodies. In one technique, an immunogen'coriiprising the polypeptide 
' is initially injected into briy ot a wide' variety of ittaiiifnals (e.g.*, mice; rhts, rabbits, 
sheep of goats). In this step, tne polypeptides of this invention may serve as the 
immunogen without modification. Alternatively, particularly for relatively short 



, ,4 



polypeptides, a superior immune response iriay be elicited if the polypeptide is joined to 



15 a carrier protein, such as bovine sferii&i albuiniii J of keynble limpet hemocyanin'. The 
immunogen fs^ injected into the animal host^ prefef^i?ly according to a predetermined 
schedule incorporating one or more^bbbsief iihmuni^^ ahiiii^ls are bled 

- • : ' . ' t * ? . .. * .i. * ( r 

periodically. Polyclonal antibodies specific for the polypeptide may then be purified 
fioiri such antisefa by, for examjple, a!ffihity chromatography using the polypeptide 

20 coupled to a suitable solid support. 

Mbribclonal antibodies specific for an abtigehic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. 
Immunol 6:5 11-519, 1976, and improvements thereto. Briefly, these methods involve 
the preparation of immortal cell lines capable of producing antibodies having the 

25 ' desired specificity (z.e., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal irrtmtinized as 
described above. The spleen cells are then immortalized by, u for example, fusion with a 
myeloma' cell' fusion ''pailn brie that is syngeneic with the immunized 

iahimai. A vakety of fusion techniiques may be employed. For exatople, the spleen cells 

30 and myeloma cells may be combined "with a noriiohic detergent for a few iftinufes and 
then plated at low density oh a selective medium that supports the growth of hybrid 
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cells, but not myelpma pells. A preferred selection technique uses HAT (hypoxanthine, 
t aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks 
polonies of hybrids are observed. Single colonies are selected and their culture 
supernatants testes^ for binding actiyity against the pplyp^tide. Hybridomas having 
high reactivity and specificity ?^ej)jreferred, ^ 

A . Mpnopjonal antibodies jrnay be isplate^i from thp supernatants of growing 
hybridoma polonies. In addition v WV?P S f ^ e ^M riic l u ^ noiay be employed to enhance the 
yield, r such as injeetion, pf.me hybrjdoma. c r e]U l jne into the peritoneal cavity of a suitable 
t " v^ftpb^ate host, $uch as a mouse. Monoclonal antibodies may then be harvested from 
>, ;*P r x* e ^ ite9 ,<9^ ?F: blopd. ^Contanjiinants may; be removed from ^antibodies by 

P^SfFp 0 ^ ^^P^HF^k^v c ^ omj ? 0 Sr a Pty>, g^ ^^ ti t °%. -Precipitation, and 
extraction. ^The polypeptides of this invention rnay be used in the purification process 

in, for example,, an affinity chromatography, step. 
; . * , A ^nuiptl?er of therapeutically : .useful molecules are known in the art which 

: < ^! ; , 5 om P rise ,antigen 7 bi»ding site? that , are capable of exhibiting immunological binding 

'.sr.Lrrrfj.ffT ^P er ^ ies g f -. ^^^^-ff^f? 1 ^,^ proteolytic enzyme papain preferentially 
: ; " , r^r ^.^eayesJgG ngo^e^.|p { _vi^ ^gy^lragments, two of which^ (tte ;^F(ab)" fragments) 
, .each copprise,a coyalent heterodimer that includes, an intact antigen^bindine site. The 
. , enzyme pepsin is.abje.to cleave Ig^Q. molecules to provide several fragment^ including 
20 the "FCab^ " fragment which comprises botii antigen-binding sites. An "Fv" fragment 
} , can be produced by preferential proteolytic pleavage of an IgM, and on rare occasions 
• IgG or IgA immunoglobulin, ir^olecule. Fv fragments are, however, more commonly 
derived using recombinant techniques known in the art. The Fv fragment includes a 
npn-covalent V H ::Y L .heterodimer including an antigen-binding site which retains much 
25 of the antigen recognition and binding capabilities of the native antibody molecule. 
Inbar et al. (1972) Proc. Nat. A,cad. Sci. USA 69:2659-2662; Hochman et al. (1976) 
Biochem 15:27.06-27 10;. and Ehrlich et al. (1980) Biochem 19:4091-4096 
. . ; , . A single chain Fy C'sFy;') polypeptide is a. covalently. linked V H "V L 

i; . .heterodimej: which is expressed from a gene fusion including V«- and V L -encoding 
30, genes linked by a peptide-encpding linker. Huston, et al. (1988) Proc. Nat. Acad. Sci. 
JJSA 85(16);5879-5883. A number of methods have. been described to discern chemical 
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structures for converting the naturally aggregated--but chemically separated-light and 
heavy polypeptide chains from an antibody V region into an sFv molecule which will 
fold into a three dimensional structure substantially similar to .the structure of an 
antigen-binding site. See, e.g. , US, Pat. Nos. 5,091,513 and 5,1.32,405, to Huston et al.; 
5 and U.S. Pat. No. 4,946,778, to Ladner et al. . ...... 

Each of the aboye-described molecules includes a heavy phain and a 
, . light . chain CDR set, respectively interposed between a heavy chain and a light chain 
FR set which provide support to the CDRS and define, the spatial relationship of the 
CDRs relative to each, other. As used herein, the term "CDR set;' refers to the three 
10 hVDervariable regions of a heavy or light chain V, region. Prpceeding from the N- 
terminus of a heavy or light chain, these regions are denoted as "pDRl," ,, CDR2, ,, and 
"CDR3" respectively. An antigen-binding site, therefore, includes six CDRs, 
comprising the CDR set from each of a heavy and a light chain V region. A polypeptide 
comprising a single CDR, (e.g., a CDR1, CDR2 or CDR3) is referred to herein as a 
15 "molecular recognition unit" Crystallograpbic analysis- of a number of antigen-antibody 
complexes has demonstrated that the amino acid residues, of CDRs form extensive 
contact with bound antigen, wherein, the mogt extensile antigen contact is with the 
heavy chain CDR3. Thus, the molecular recognition units are primarily responsible for 
the specificity of an. antigen-binding site. n ; ., )0 

on As used herein, the term "FR set" refers to the, four flanking amino acid 

sequences which frame the CDRs of a CDR set of a heavy or light chain V region. 
Some FR residues may contact bound antigen; however, FRs are primarily responsible 
for folding the V region mto the antigen-binding site, particularly the FR residues 
directly adjacent to the CDRS. Within FRs, certain arnirio residues and certain structural 
25 , features are very highly conserved. In this regard, all V region sequences, contain an 
internal disulfide loop of around 90 amino acid residues. When the V regions fold into a 
binding-site, the CDRs are displayed as projecting loop motifs .which form an antigen- 
bindine surface. It is generally, recognized that there are conserved structural regions of 
FRs which influence the folded shape of the CDR loops into certain "canonical" 
30 structures-regardless of the precise CDR amino acid sequence. Further, certain FR 
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residues are known to participate in non-covalent interdomain contacts which stabilize 
the interaction of the antibody heavy and light chains. 

A number of "humanized" antibody molecules comprising an antigen- 
binding site derived from a non-human immunoglobulin have been described, including 
5 chimeric antibodies having rodent V regions and their associated CDRs fused to human 
constant domains (Winter et al/U99lJ l^ature '349:293-299; Lobuglio et al. (1989) 
Proc. Nat. Acad. Scil USA 86:42204224; Shaw et ail (1987) J Immunol. 138:4534- 
4538; arid Brown et al. (1987) dancer lies, tf&ij -3 5S3), rodent blDRs'grafted into a 
human supporting Pk prior to fusion 'with an appropriate human' antibody constant 

10 domain (itiechmann et al. (i 9^8") Nature 332:323-327; : Verhoeyeri et al? i;i988) Science 
239^1534-153^6; ; and Jones ef' al: (19^6) V Nature Tr 32i r&S^ and ' rodent CDRs 
supported by reconibmariily veiieere# rodent tl^s" (European Patent Publication No. 
'51 9,596, published Dec. ' 23, 1*992). * these "humanized ,v molecules are designed to 
minimize' unwanted immunological response toward rodent antiriuman antibody 

15 molecules which limits the duration and effectiveness of therapeutic L appfications of 

those moieties in human recipients. J ■ • 

• '-i-;'" '■■''.) /^■ J , .:h i V i fi'i o::;^ ovinia j-~!t r^j-* - : \ - pp- • ■ - ^ 

As used herein, the terms "veneered FRs" and "recombinahtly veneered 

FRs refer to the selective replacement of FR residues from, e.g., a rodent heavy or light 

chain V region, with human FR residues in order to provicie a xenogeneic molecule 

20 comprising an antigen-binding site which retain^ substantially all of the native FR 
polypeptide folding structure. Veneering techniques are based on the understanding that 
the ligand binding characteristics of an antigen-binding site aire determined primarily by 
the structure and relative disposition of the heavy and light" chain CDR sets within the 
antigen-binding surface. Davies et ai: (1990) Ann. Rev. Biochem. 59:439-473. Thus, 

25 antigen binding specificity can be preserved 'in a htimanized antibody bnly wherein the 
Ct)R structures, their interaction with each other, and th^ir interaction with' the rest of 
the V region domains are carefully maintairied/By using vend^ring fecliniques, exterior 
(e.g., solvent-accessible) FR reisiclieis' wWch^are rea&iy enb6unfered by the immune 
system are selectively replaced with human residues to provide a : hybrid m6lecule that 

30 comprises either a weakly immunogenic, or substantially hon-immunogenic veneered 

' ' surface:"' ^ ' ' ' " -' .'- ' 
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' The process of veneering makes use of the available sequence data for 
human antibody variable domains compiled by Kabat et al., in .Sequences of Proteins of 
' Immunological Interest, 4th ed., (U.S. Dept. of Health and ; Human Services, U.S. 
Gbvemmerit Printing Office, 1987), updates to the Kabat database, and other accessible 
5 U.S. and foreign databases ; (both nucleic acid and protein). Solvent accessibilities of V 
region amino acids can be' deduced from the known three-dimensional structure for 
^ Whan arid murine antibody fmgments; There are twd general steps in veneering a 
murine antiien^binding site! IhltiaUy, the FRs of ^variable domains of an antibody 
^-yS,i^ker^t ait coftipared vm^tt^&m^ sequences of human variable 
10 domains dbtalned from the abbve^dehtified^^ human V 

r ^gion^are then compared residue by residue to cdrresponding murine-amino acids. The 
residues in the murine FR which differ from' the human counterpart are replaced by the 
residues present in the Wan moiety using recombinant techniques well known in the 
'art: Residue switching is only carried but with moieties which are at least partially 
15 "exposed (solvent accessible); and carets exercised- f* the replacement , of amino acid 
' residues which may have a sigmficah^^fect Wte tertiary structure .of V region 
^""^mains; ^'^'pro^giycme^^^aifi^ adids>- <■<> 
r. -> "''J 1 ^ ;; - In this m 'anher, the f eWltaht " veneered" murine antigeri^bmdmg sites are 
thus designed to retain the murine CDR< residues, the residues substantially adjacent to 
^0 ' the CDRs, the residues identified as' buried or mostly buried (solvent inaccessible), the 
residues believe* to participate in nori-covalent {e.g., electrostatic and hydrophobic) 
; ' contacts between heavy and light chain domains, and the residues from conserved 
' structural regions of the FRs which are believed to influence the "canonical" tertiary 
structures of the CDR lo'ops- These design criteria are tiien used to prepare recombinant 
25 ; nucleotide sequences which combine the CDRs of both the heavy and light chain of a 
? " f "murine antigen-binding site into humaniappearing FRs that can be used to transfect 
< rnammaiian cells for the expression of recombinant human antibodies which exhibit the 

: antigen specificity of ^ 

v? In another embodiment of. the invention,' monoclonal antibodies of the 

30 present invent may be coaled to one or more therapeutic agents. Suitable agents in 
'this regard include' radionuclides; differentiation inducers, drugs, toxins, and derivatives 



' WO^6iV§6^0 PCT/US01/18577 

72 

thereof. .Preferred radionuclides include 90 Y,. I23 I, 125 I, ,3, I, 186 Re, ,88 Re, 2U At, and 
212 Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbpl esters , and butyric acid. Preferred 
toxins' include ricin, abrin, diptheria toxin, r cholera toxin, rgelonin, Pseudomonas 
5' exotoxin, Shi geHa toxin,, and ppkeweed aptivi^al protein. ^ . 

A therapeutic agent ; naay be .coupled (e.gi, coyalently bonded) to a 
suitable monoclonal; antibody jejther , daftly ,or indirectly vi^a linker group). A 
> direct reaction between , on* ^agent and qji; antibody isf, possible when each possesses a 
:V substituent capable; of rp^^^^wth^e other. For example, a pucjepphilic group, such 
10 hs aniamhTb:oir ^ sulfh^dryi grpup^pn ope n^ay^be capable of .reacting with a carbonyl- 
' containing group, siach : as ; a#i anjhydridje or an ( aci4 f h^Hde, or y/ith an alkyl group 
containing a good leaving grpup r (e.g., a hali4e) iqn tjie 9ther. ( t . 

• ; Alternatively, it may.be desirable, to couples a therapeutic agent and an 
? antibody via, a linker group y- A linker group can. function as a spacer to distance , an 
45 T antibody from an agent, i# order to avoid interference with.jbindipg capabilities. .-..A 
' l/ linker group can. alsojsery^ ^p- ippr^as^ the chemical reactivity of a substituent on an 
agent or an antibody jap^ : An^ ixicrease ^in 

s chemical Reactivity maty also facilitate the use of agents, pr functional groups on agents, 
which otherwise would not be possible. . „ t 
20 i It will be evident -to those skilled in the art that a variety of bifunqtional 

or polyftinetional reagents, both homo r apd hetero-functional (pich as those described 
in the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
group. Coupling may be effected^ for example, through amino groups, carboxyl groups, 
sulfhydryl groups or oxidized carbohydrate resid\iejS Nt Tfyere are numerous references 
25 describing such methodology, e.g., U.S. Patent N,q. 4,671,958, to Rodwell et al._ 

Where a therapeutic agent is more pptent when free from the antibody 
portion of the imrnunocpnjugates pfth^ present inv^ntiop, it may be desirable to use a 
linker group which is cle£yable. ,duripg or upon internalization into a ceil. A number of 
different cleayable linker - groups have been described. The mechanisms for the 
30 ■'..! intracellular release of ^n agent from these linker groups include cleavage by reduction 
of a. disulfide l?ond {e.g., U.S. - Patent t No. 4,489,710, to Spitler), by irradiation of a 
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photolabile bond {e.g., U.S. Patent No. 4,625,014, to Senter et al.), by hydrolysis of 
derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by 
serum complement-mediated hydrolysis (e.g., U.S. Patent No. 4,671,958, to Rodwell 
et al.), and acid-catalyzed hydrolysis (e.g., U.S. Paterit No. 4,569^789, to Blattler et al.). 
5 i ■•! It may be desirable to couple more than one agent to an antibody. In one 

; ^embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, inore than one type of (agent may be coupled to one antibody. 
Regardless of the pai^<?ularj embodim^it, immtinocdiljugates with more than one agent 
:o may be prepared in ^ ivariety of waysl * For example, more than one agent may be 
10 coupled directly to;x ail antibody molecule, or linkers that provide multiple sites for 
attachment can be used. Alternatively, a carrier can be used, 
g ; ii: ! u ?. v ; A carrier may bear the agents! in a variety of ways, including covalent 
bonding esither directly or via a linker group. Suitable carriers include proteins such as 
albumins (e.g., U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 
15 -such as aminodextran (e.g.\ U.S: Patent Nov 4^699,784, to Shih et al.). A carrier may 
also bear an ^ agent by noncovdent ^bcmding or by r encapsulation^ such as within a 
: : liposome vesicle:(^ Caixieirs specific for 

r ^ agehtis include a radioMlogenated small molecules and chelating 

compounds. For > example, U.S. Patent No: 4,735;792 discloses representative 
20 radiohalogenated? small molecules arid their synthesis. A radionuclide chelate may be 
r ^ formed from chelating compounds that include those; containing nitrogen and sulfur 
atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For 
example, U.S; Patent Noi 4,673,562, to Davison et b1. discloses representative chelating 
compounds and their synthesis. - 

25 T Cell Compositions > 

1 i: f . The present invention, in another aspect; provides T cells specific for a 

• tumor pblypeptide disclofeeS herein, or for a variant or derivative thereof. Such cells 
< may generally be prepared in vitro or ex vivo, using standard procedures. For example, 
T cells, may be ' isolated from bone marrOw 9 r;peripheral bloody or a fraction of bone 
30 marrow^ or peripheral blood of a patient, using a commercially available cell separation 
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. system, such asthe.Isolex™ System, available from Nexell Therapeutics, Inc. (Irvine, 
CA; see alsCyU.S.iPatent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 
91/161 1 6 and WO 92/07243). Altemativelyi T cells may be derived'from related or 
{ ;p i unrelated Humans, non-human mammals, cell lines or cultures. < 

5 T cells may be stimulated witb a polypeptide, polynucleotide encoding a 

polypeptide and/or ani antigen presenting «ell. (APC> that { expresses such, k polypeptide. 
~? Such stimulation is performed 'unde^/conditioife and for a time sufficient to permit the 
generation of T cells; that are , specifio-for -mea polypeptide of interest. , =■ Preferably, a 
" • .t? tumor polypeptide, or* polynucleotide of the /invention ;as present within ;a delivery 
r :v 10: - vehicle, such as a microsphere; to facilitate the generation of specific T cells. 

* T cells are considered to >be specific for ja polypeptide; of the present 
^ ' invention if the T cells, specifically proliferate, secrete ^cytokines or kill target cells 
coated Jwith the polypeptide or. expressing ,a gene encoding the polypeptide. T cell 
specificity may be evaluated using "any; of a variety of standard ^techniques. For 
m^j IS: ^example, vvithih a .cnKahium release assay; or 7 proliferation assay, a stimulation index of 
mdli, ^ mors. than twcifokl inci^asean^sisjand/or^prol^ compared tojjaegative controls, 
. : o indicates T cell spfccificity^Sufefi^ described 
.; u in Chen et al., Cancer Re&..:54:ifm*l070, 1994k Alterjaatively^dete^tion of the 
i proliferation of Tcells may >be accomplished by a variety of known techniques. For 
20 example, T cell proliferation , can be detected by measuring an increased rate of DNA 
synthesis (e.g., by pulse-labeling cultures of -/I, cells with, tritiated ^ thymidine and 
o measuring the amount of tritiated thymidine incorporated into DNA). Contact with a 
- ; tumor polypeptide (100 ng/ml * 100'ng/ml, preferably 200. rig/ml - 25 ug/ml) for 3 - 7 
days will typically result in at least a two fold increase in proliferation «?f the T cells. 
25 Contact as described above for 2-3 hours should result in activation of the T cells, as 
measured using standard cytokine assays in which avtwq fold increase in the level of 
cytokine release (e.g., TNF or lFN-y) is indicative of £ cell :activation (see Coligan et 
aU Current Protocols, m Immunology, vol. h Wiley,.Interse.ience i (Qreene-1998)). T 
-:•> 5 - cells that havq been, activated in response, to, a Jwmor .polypeptide; polynucleotide or 
: ...i 30 polypsitfde-expreiSS^^ 

cells may .be.. ; expanded using standard techniques. Within preferred embodiments, the T 
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cells are derived from a patient, a related donor or an unrelated donor, and are 

administered to the patient following stimulation and expansion. 

For therapeutic purposes, CD4+ or CD8+ T cells that proliferate in 
response to a tiimor polypeptide, polynucleotide or APC can ; be expanded in number 

5 either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 
variety of ways. For example, the T cells can be re-exposed to a tumor polypeptide, or 
a short peptide corresponding to an immunogenic portion of such a polypeptide, with or 
without the addition of T cell growth factors, such as interleukin-2, and/or stimulator 
cells that synthesize a . tumor iw^gtide. . Al^rnatiyely, ^ one or more T cells that 

10 proliferate in the presence of the tumor polypeptide can be expanded in number by 
clonine. Methods for cloning cells are well knowp in the art, and include limiting 
dilution. 



T Cell Receptor Compositions ? • 

15 The T cell receptor (TCR) . consists of 2 different, highly variable 

polypeptide chains, termed the T-cell receptqr, a .and , p chains, that are linked by a 

r m ^ Mi^'xZR; i f 9 w 1 148 " 159 - 

. , Elsevier Science Ltd/Garlan4 Publishing. 1999). The, o/p heterodimer complexes with 
. the invariant CD3 chains ,at the cell membrane. This cpmplex recognizes specific 
20 r antigenic, :peptides bound to f MHC molecules. The enormous diversity of TCR 
s specificities is generated, much like immunoglobulin diversity, through somatic gene 
rearrangement, The p chain genes contain oyer 50 variable (V), 2 diversity^), over 10 
: joining (J) segments; and 2. constant region segments (C). The ot.chain genes contain 
over 70 V segments, and over 60 J segments but, no D segments, as well as one C 
25 segment. During T cell; development in titie thymus, the D to J gene rearrangement of 
the p chain occurs, followed by the V gene segment rearrangement to the DJ. This 
, ; functional VDJp exon is transcribed and spliced to join to a Cp. For the a chain, a V tt 
gene segment rearranges to a J„ gene segment to.create the. functional excn that is then 
- transcribed arid spliced to the C&. ^Diversity is further increased during the 
30 ' 1 fecombmatiori process by the- random addition of F and N-riucleotides between the V, 
D ; and J segments of the p chain and between -the V and J segments in the a chain 
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(Janeway, Travers, Walport. Immunobiology. Fourth Ed., 98 and 150. Elsevier Science 
Ltd/Garland Publishing. 1 999). ' 1 ;! 

The present invention, in anotnfer aspect^' provides ''TCRs specific for a 
colon tumor polypeptide disclosed herein, or for a variant or derivative thereof. In 
5 accordance with the present invention, polynucleotide and annno acid sequences are 
provided for the V-J or V-D-J junctional regions or parts thereof 3for the alpha and beta 
chains of the T-cell receptor which recojgnize tumor polypeptides described herein. In 
general, this aspect of the invention relates to T-cell receptors which recognize or bind 
tumor polypeptides presented in the context of MfrC.' In a preferred embodiment the 
10 tumor antigens recognized by the T-cell receptors comprise a poiypeptide ofthe present 
invention. For example," cDNA encoding a TCFL specrficior^a colon tumor peptide can 
be isolated from T cells specific for a tumor" polypeptide using standard molecular 
biological and recombinant DNA techniques. " r * 

This invention further includes the T-cell receptors or analogs thereof 
15 having substantially the same function or activity 'as %e T-cell receptors of this 
invention which recogmze or birid tumor polypeptides. Such receptors include, but are 
not limited to, a fragment of th£ receptor, or a substitution, addition or delation mutant 
of a T-cell receptor provided herem'* This c Invention also eWco'mpasstes ijbrypeptides or 
•p^ptides'liifit-'are' s^b^^tiall^^cmidRygdus"to uie T-cell receptors ^provided herein or 
20 that retain substantially the ^ame activity. The lerm "anafog" includes any protein or 
polypeptide having an amino acid residue sequence substantially idehtrcal to the T-cell 
receptors provided herein in which one' or moit residues, preferably no more than 5 
residues, more preferably '- no more- than 25 1 residues ; have been conservatively 
substituted wdth a functionally' similar residue and livhich displays the functional aspects 
25 of the T-fcell receptor as described herein. !1 ' ; v. - v > 

The presenf inventioii further proviaes for suitable mammalian host 
cells, for ' example, non-specifici"T cells,- that are transfected with , a j polynucleotide 
encoding TCRs specific.for a-poljfpeptide described, herein, theieby rendering the host 
cell specific for the polypeptide; ; The a and P chains of tiie-TGR may. be contained on 
30 separate expression vector? or » alternatively^ o^ a, sipgle exjg-essicpi ^ vector that also 
' containis an internal ribosome ^try site (IR^S 
• -.gene ..downstream, pf the IRES. . Said host, pejls. expressing; TCRs specific for the 
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polypeptide may be used, for example, for adoptive immunotherapy of colon cancer as 

discussed further below. . 

In further aspects of the present invention, cloned TCRs specific for a 
polypeptide recited herein may be used in a kit for the diagnosis of colon cancer. For 
> example, the nucleic acid sequence or portions thereof, of colon tumor-specific TCRs 
' can be used as probes or primers for the detection of expression of the rearranged genes 
encoding the specific TCR in a biological sample: therefore, the present invention 
' further provides for an assay for detecting' messenger RNA or DNA encoding the TCR 
specific for a polypeptide. 
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1 0 PHARMACEUTICAL COMPOSITIONS 

In additional embodiments, the present invention concerns formulation 
of one or more of the polynucJLeotide,' polypeptide, T-cell, TCR, and/or antibody 
compositions disclosed herein in pharmaceutically-acceptable carriers for 
administration to a cell or an animal, either alone, or in combination with one or more 

. ■ < ■ - •• .1.-- , «• - ,■ ' ■"■ " H.,^ . -c ' '"., r > XV:- : \ : I; , ./ ..; ij- • U-'.; ' •. : -. { >> 

*• ■' •'■ - " •. j. ■ " - * " ■ 

1 5 other modalities of therapy. _ 

It will be understood that, if desired, a composition as disclosed herein 

may be administered in combination with other agents as well, such as, e.g., other 

proteins or polypeptides or various pharmaceutically-active agents. In fact, there is 

virtually no limit to other components that may also be included, given that the 

20 additional agents do not cause a significant adverse effect upon contact with the target 

cells or host tissues. The compositions may thus be delivered along with various other 

agents as required in the particular instance. Such compositions may be purified from 

host cells or other biological sources, or alternatively may be chemically synthesized as 

described herein. Likewise, such compositions may further comprise substituted or 

.... . i ^ { - - 

25 derivatized RNA or DNA compositions. 

Therefore, in another aspect of the present invention, pharmaceutical 
comnositions are provided comprising one or more of the polynucleotide, polypeptide, 
antibody, TCR,. and/or T-cell compositions described herein in combination with a 
physiologically , .acceptable carrier. In certain, preferred embodiments, the 

30 pharmaceutical compositions of the invention comprise irrimunpgenic polynucleotide 
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and/or polypeptide compositions of the invention for use in prophylactic and theraputic 
vaccine applications. Vaccine preparation is generally described in, for example, M.F. 
Powell and MJ. Newman, eds., "Vaccine Design (the subunit and adjuvant approach)/ 1 

Plenum Press (NY, 1995). Generally, such compositions will comprise one or more 

s v . ' r . }- * ■ r i . 

5 polynucleotide and/or polypeptide compositions of the present invention in combination 
with one or more immunostimulants. 

: It will be apparent that any of the pharmaceutical compositions described 
herein can contain pharmaceutical^ acceptable salts of the polynucleotides and 
polypeptides of the invention. Such salts can be prepared, for example, from 
10 pharmaceutical^ acceptable non-toxic bases, including organic bases (e.g., salts of 



* : i 



primary, secondary and tertiary amines and basic amino acids) arid inorganic bases 



'TO , -r 



f 



1 '_»(. : * 



(e.g., sodium, potassium, lithium, ammonium, calcium and magnesium salts) 



In another embodiment, illustrative immunogenic compositions, e.g., 



■ 1 



15 



vaccine compositions, of the present invention comprise DNA encoding one or more of 
the polypeptides as described above, such that the polypeptide is generated in situ. As 



r 



noted above, the polynucleotide may be administered within any of a variety of delivery 
systems known to those of ordinary skill in the art. Indeed, numerous gene delivery 
techniques are well known in the art, such as those described by Rolland,' Crit Rev. 
Therap. Drug Carrier Systems 75:143-198, 1998, and references cited therein. 



20 



Appropriate polynucleotide expression systems will, of course, contain the necessary 
regulatory DNA regulatory sequences for expression in a patient (such as a suitable 



25 



promoter and terminating signal). Alternatively, bacterial delivery systems rnay involve 
the administration of a bacterium (such as Bacillus-Calmette-Giterriri) that expresses an 
immunogenic portion of the polypeptide on its cell surface of secretes such an epitope. 

Therefore, in certain embodiments, polynucleotides encoding 
immunogenic polypeptides described herein are introduced into suitable mammalian 

. ■ r ■ . ' 

host cells for expression using any of a number of known viral-based systems. In one 



illustrative embodiment, retroviruses provide a convenient "arid Effective platform for 
gene delivery systems. A selected nucleotide sequence encoding a polypeptide of the 
present invention can be inserted into a vector and packaged ur retroviral particles using 

' *' *.,*.'••■"*. f * • r ' ■ ■ r , . ' ' ' t / ' ' { * *' ' ' ' "f . i ■ ' *' ' • * J 

techniques known in the art. The recombinant virus can 'then be isolated and delivered 



30 
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to a subject. A number of illustrative retroviral systems have been described (e.g., U.S. 
Pat. No. 5,219,740; Miller and Rosman (1989) BioTechniques 7:980-990; Miller, A. D. 
(1990) Human Gene Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Burns 
et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
5 (1993) Curi Opin. Genet. DevelopV 3 : 1 02-109. 

In addition, a numfeer of illustrative adenovirus-based systems have also 
been described. Unlike retroviruses which integrate into the host genome, adenoviruses 
persist extrachromosomally thus minimizing the risks associated with insertional 
mutagenesis ^^-Ahmacf and GraHam (1986) J. Virol. 57:267-274; Bett et al. (1993) J. 
10 Virol. #:59li -5921 /liitteredbr et al. (1994) Human Gene Therapy 5:717-729; Seth et 
^ ai; (1994) J. ViroL 6^:933-940; Barr et al'. (1994) Gene Therapy 1:51-58; Berkner, K. L. 
' ' "'(I !98^ioTechniques 6:616-629; and Rich et al. (1993) tiuman Gene Therapy 4:461- 
476). 

Various adeno-associated virus (AAV) vector systems have also been 

15 developed for polynucleotide delivery. AAV vectors can be readily constructed using 
techniques weli known in the art. See, e.g., U.S. Pat. Nos. 5,173,414 and 5,139,941; 
International' Publication Nosl WO 92/6 1070 and WO 93/03769; Lebkowski et al. 

' ' (1988) Molec. Cell. Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring 
Harbor Laboratory Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533- 

20 539; Muzyczka, N. (1992) Current Topics in Microbiol, and Immunol. 158:97-129; 
Kotin, R. M. (1994) Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene 
Therapy 1 :165-1 69; and Zhou et ai. (1994) t Exp. Med. 179: 1 867-1 875. 

Additional viral vectors useful for delivering the polynucleotides 
encoding polypeptides of the present invention by gene transfer include those derived 

25 from the pox family of viruses, such as vaccinia virus and avian poxvirus. By way of 
example, vaccinia virus recombinants expressing the novel molecules can be 
constructed as follows. The DNA encoding a polypeptide is first inserted into an 
appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence encoding thymidine kinase (TK). This vector is 

30 then used to transfect cells which are simultaneously infected with vaccinia. 
Homologous, recombination serves to insert the vaccinia promoter plus the gene 
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encoding the . polypeptide of interest into the viral genome. The resulting TK.sup.(-) 
recombinant can be selected by cultupng the cells in the presence of 5- 
bromodeoxyuricjine and picking viral plaques resistant thereto. 

r _ A vaccinia-based infection/transfection system can be conveniently used 
to provide for inducible, transient expression or coexpression of one or more 
polypeptides described herein in host cells of an organism. In this particular system 
cells are first infected in vitro with a vaccinia virus recombinant that encodes the 
bacteriophage T7 RJ4A pplymerase. This polymerase displays exquisite specificity in 
that it only transcribes templates bearing T7 promoters. Following infection, cells are 
10 transfected with the polynucleotide or polynucleotides of interest, driven by a T7 
promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 



recombinant transcribes the transfected DNA into RNA which is then translated into 
polypeptide by the host translational machinery. The method provides for high level, 



1 1 



transient, cytoplasmic production of large quantities of RNA and its translation 
15 products. See, e.g., Elroy-Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743- 
6747; Fuerst et aL Proc. NatL Acad. Sci. USA (1986) 83:8122-8126. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, 
can also be used to deliver the coding sequences of interest. Recombinant avipox 
viruses, expressing immunogens from mammalian pathogens, are known to confer 
20 protective immunity when administered to non-avian species. The use of an Avipox 
vector is particularly desirable in human and other mammalian species since members 
of the Avipox genus can only productively replicate in susceptible avian species and 
therefore are not infective in mammalian cells. Metho<ds for producing recombinant 
Avipoxviruses are known in the art and employ genetic recombination, as described 
above with respect to the production of vaccinia viruses. See, e.g., WO 91/12882; WO 
89/03429; and WO 92/03545. 

Any of a number of alphavirus vectors can also be used for delivery of 
polynucleotide compositions of the present invention, such as those vectors described in 
U.S. Patent Nos 5,8^3,723; 6,015,686; 6,6o8 3 035 and 6^015,694. Certain vectors based 



25 



30 



1 1 



on Venezuelan Equine Encephalitis (VEE) can also be used, illustrative examples of 
which can be found in U.S. Patent Nos. 5,505,947 and 5,643,576. 



BNSDOCfD: <WO_01 96390 A2_l > 



WO 01/96390 PCT/US01/18577 

81 



Moreover, molecular conjugate vectors, such as the adenovirus chimeric 
vectors described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et 
al. Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery 



under the invention. 

Additional illustrative information on these and other known viral-based 



delivery systems can be found, for example, in Fisher-Hoch et al., Proc. Natl. Acad Set 
USA 86311-321, 1989; Flexner et al., Ann. N.Y. Acad Sci. 560:86-103, 1989; Flexner 
et al., Vaccine 5:17-21, 1990; U.S. Patent Nos. .4,603, U 2, 4,769,330, and 5,017,487; 
WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; WO 91/02805; 

10 Berkner, Biotechniques 6:616-627, 1988; Rosenfeld et al., Science ^52:431-434, 1991; 
Koils et al., Proc. Natl. Acad. Sci. USA 91 :2 15-219, 1994; Kass-Eisler et al., Proc. Natl. 
^cad Sci. P0:1 1498-1 1502, 1993; Guzman et al., Circulation 55:2838-2848, 1993; 
and Guzman et al., Cir. Res. 73:1202-1207, 1993. 5 

i In certain embodiments, a polynucleotide may be integrated into the 

1 5 eenome of a target cell. This integration may be in the specific location and orientation 
via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet fturther embodiments, the 
polynucleotide may be stably maintained in the cell as a separate, episomal segment of 
DNA., Such polynucleotide segments or "episomes 11 encode sequences sufficient to 

20 permit maintenance, and replication independent of or in syiicjironization with the host 
cell cycle. .The manner in which the expression construct is delivered to a cell and 
where in the cell the polynucleotide remains is dependent on the type of expression 

construct employed. , , r ... . 

In another embodiment of the invention, a polynucleotide is 
25 administered/delivered as "naked" DNA, for example as described in Ulmer et aL, 
Science 25^:1745-1749, 1993 and reviewed by Cohen, Science 259: 169 1-1 692, 1993. 
The uptake of naked DNA may be increased by coating the DNA onto biodegradable 
beads, whicl^ are efficiently .transported into th^ f 

In still another embodiment, a composition of the present invention can 
30 be delivered via a particle bombardment approach, many of which have been described. 
In one illustrative example, gas-driven particle acceleration can be achieved with 
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devices such as those manufactured by Powderject Pharmaceuticals PLC (Oxford, UK) 
and Powderject Vaccines Inc. (Madison, WI), some examples of which are described in 
U.S. Patent Nos; 5,846,796; 6,010,478; 5,865,796; 5,584^07^(1 EP Patent No. 0500 

r> ' j" * 1 - ' 1 . J' ■ . , r , i , 

799. This approach offers a needle-free delivery approach wherein a dry powder 
5 formulation of microscopic particles, such as polynucleotide or polypeptide particles, 
are accelerated to high speed within a helium gas jet generated by a hand held device, 
propelling the particles into a target tissue of interest 

In a related embodiment, other devices and methods that may be useful 
for gas-driven needle-less injection of compositions of the present invention include 
10 those provided by Bioject, Inc. (Portland, OR), some examples of which are described 
' M in U:S. Patent Nos. ^^B;S24; 5^64^13; l£3l2,335; 5,:&3^1; 5^9$$63; 5,5^0,639 
and 5,993,412. 

According to Mother embodiment, the pHarmaceutical compositions 
descnbed herein will comprise one or more immunostimularits in addition to the 
imrriunogenic polynucleotide, polypeptide, antibociy, T-cell, TCR, and/or APC 
compositions of this invention. Xii immxmostimulant refers to essentially any substance 
that enhances or potentiates an immune response (antibody and/or cell-mediated) to an 
exogenous antigen. One preferred type or immunostimulant comprises an adjuvant. 
Many adjuvants contain a substance designee! to protect the antigen from rapid 
20 catabolism, such as aluminum^ hydroxide or mineral oil, and a stimulator of immune 



responses, such as lipid A, Eortadella pertussis or Mycobacterium tuberculosis derived 
proteins. Certain adjuvants are commercially available as,' for example, Freund's 
Incomplete Adjuvant an<j Complete Adjuvant (E)ifco Laboratories, Detroit, MI); Merck 
Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (Smitliklihe Beecham, 



it 



25 Philadelphia, PA); aluminum salts such as aluminum hydroxide; gel (alum) or aluminum 
phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; 
acylated sugars; cationically or anioriically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspneres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, intefleiikiri-% -7; -12, and other like ^owth factors, may 
30 also be used as adjuvants. 
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Within certain embodiments of the invention^ tiie> adjuvant composition 
is preferably one that induces an immune response predominantly of the Thl type, 
Highlevels of Thl-type cytokines (e.g., IFN-y, TNFa, IL-2 and IL-12) tend to favor the 
induction of cell mediated immune responses to an administered antigen. In contrast, 
5 high levels of Th2-type cytokines (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the 
■ induction of humoral immund responses. Following application of .a vaccine as 
, prb\dded herein, a patient r\yill support an, immune response that includes Thl - and Th2- 
type responses. < Within a preferred embodiment* in which a response is predominantly 
Thl-type, the level of Thl-type cytokines will increase to a greater extent than the level 
10 > /of Th2-type cytoMaes. The levels of these cytokines may be readily assessed lusing 
* - standard assays. Fox a review of the famSliesxqf :Cytokines,.se^ Mosmann and Coffman, 
- ^Ajtri^ Rev. Immunol: 7:H5-173 f 1988. ■^ ^^I 1 ^ ( C , ^r. o . ; : 

;^ ,L Certain preferred adjuvants \. for eliciting predominantly Thl-type 

i response include, for example, a combination of monophfosphoryl lipid A, preferably 3- 
IS de-O-acylated monophosphoryl lipid A* ? tpgetherr withrr an ^ aliuninum salt. :MPL® 
adjuvants are available from rGorixa Corporation j(Seattle§ WA; s&e, for" example, US 
Patent Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094), CpG-containing 
^ oligonucleotides (in which tthe €pG dinucleotide is unmethylated) also induce a 
• - predominantly Thl response. Such oligonucleotides are well known and are described, 
20 for example, in W0 96/02555y WO) 99/33488 and U.S. Patent Nos. 6,008,200 and 
5,856,462. Immunostimulatory *DNA sequences iare also described, for example, by 
- ? Satb et stL, Science 275:352, 1996. Another preferred adjuvant comprises a saponin, 
- such as Quit A, -tfr derivatives ^thereof, including QS21< and QS7 (Aquila 
; > -Biopharmaceuticals Inci, Framihgham, MA); Escin; Digitonin; or Gypsophila or 
25 < 'Chenopodium- quinoa sapdnins ^ Other preferred formulations include more than one 
^ saporiinHti ^the y adjuvant' 1 combinations of the present invention, for example 
combinations of at least two of the; following group comprising QS21, QS7, Quil Aj P- 
•■ ' escin^ ordigitbnin. ■ 1 : -v». ■ -:'<.-> ';■ 

; 1 •>< ■ . Alternatively ithe' s^^ 

30 vehicles corhpb'sed of chitosan or other polycationic polymers, polylactide and 
polylactide-co-glycolide particles, poly-N-acetyl glucosamine-based polymer matrix, 
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particles ^composed ^ of polysaccharides < or chemically modified polysaccharides, 
= liposomes and iipid-based particles; particles composed of gly^erol monoesters, etc. 
The saponins may also be formulated in ihe presence of cholesterol to form particulate 
structures such as liposomes or ISCOMs. Furthermore, the saponins may be formulated 
i 5 together/with a polyoxyethyfene ether or ester, in either a inoh-particulate solution or 
suspension, or -in a particulate structure ^such, as a* paucilamelar liposome or ISCOM. 
• .. ': The saponins* may also be formulated :~whh <excipients; such as Carbopol^ ito increase 
; 1 viscosity, fcr may be formulated dn a dry powdfer form wit& a powder excipient such as 
-i vj /lactose. »-v :-; n oi ;;.v- --'tr?;:oJ ,. ! * r . L ^-\rT}o ,v . J; iv^.-.i" 

10 ^; : vv_ : In, one v preferred 5 embodiment, ; the adjuvant system; includes the 
combination > of a as the 

combination of QS21 and 3D-MPL® adjuvant, as described in WO .94/00153, or a less 

i. 

a ; reactogenib composition; where the :QS2i isiquenched with cholesterol, as described-dn 
' WO 96/33739. Other preferred fGrm#ations : comprise an oiUn-water emulsion and 
r '\it5 tocopherol. Anotheri particulariy preferred adjuvant foijmulation employing QS21, 3Dr 
?aj -iQ " MPh® adjuyaM and ^tecjopheiirotn m> Tan :6iHn-water ( emulsion is /described in . WO 
:4nir-'i; ; ..\T:-95/17210. * "0. ^ WO^dX** :lld -^v ^tc'l # 

■ . i t Another erihanced /adjuvant systenk involves the combination of a. CpG- 
containing oli^6hucleotide and a saponin derivative particularly the combination of 
20 iCpG and QS21 is disclosed ill WO 00/09159. Preferably the formulation^ additionally 
jt comprises Ian oil in water emulsion and tocopherol. r 

; Additional illustrative adjuvants r for use in the jAairnaceutical 
.< compositions of the invention include Montanide ISA 72©> (Seppic, France), SAF 
(Chiron, California, United States), ISGOMS(CSL), MF-59 (Chiron), the SB AS series 
;= 25* of adjuvants (e.g.,:SBAS-2 or SBAS-4, avfcilahle from SmithKline Beecham,>Rixensart, 
Belgium), Detox (Enhanzyn®) (Corixa^ HamUton-MT)* RC-529: (CQrixa, Hamilton, 
u MT) and .other ajninoalkyl glucosaminide 4-,phosphates (AGPs)^ such as those 
described in pending U.S. Patent Application Serial Nos. . 08/853, 826 and 09/074,720, 
; *v ; the disclosures. of which are: incorporated:herein .by reference in their entireties, and 
. 30 : piolyoxyethylene ether adjuvants such as those described ift- WO 99/52549A1 

^ i t i ' i . i - -* i ■■ . ■ : ♦ ^ * 1 . i ♦ { f - - ( ■ j ■ * ** : p » ■ ( * ' ■ * . 
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Other preferred adjuvants include adjuvant molecules of the general 
formula " - 

(I): HO(CH 2 CH 2 6)„-A-R, ' • ' 

wherein, n is 1-50, A is a bond or -C(O)-, R is Ci. 50 alkyl or Phenyl Ci- 50 alkyl. 
5 One embodiment of the present invention consists of a vaccine 

formulation comprising a ^lyoxye&ylerie ether of genera! formula (I), wherein n is 
between 1 arid 50, preferably 4-24, most preferably 9; the R component is C1.50, 
preferably ' C4-C20 alkyl and most preferably ' C 12 alkyl, and .^f is a bond. The 
Concentration of the polyoxyethylene ethers should be in the range 1-20%^ preferably 

lO* from 6j-ib%, and inost preferably in the range <X1-1%: Preferred polyoxyethylene 
"ethers are selected from the following group: poiy^^ ether, 
polyoxy^^ ether, ^olyoxyethylene-4- 

lauryl ether, polyoxyetHylene-35-lauryl ether, arid polyoxyethylene-23-lauryl ether. 
Polyoxyethylene ethers such as polyoxyethylene lauryl ether are described m the Merck 

15 index (12 th edition: entry 7717). These adjuvant molecules are described iA WO 

The polyoxyethylene ether according to the general formula (I) above 
may, if desired, be combined wifii another adjuvant. For example, a preferred adjuvant 
combination is preferably with CpG as described in the pending VKl patent application 
20 GB 9820956.2. " ; ' ' ' v ! 

According to another embodiment of this invention, ah immunogenic 
composition described herein is delivered to a host via antigen presenting cells (APCs), 
such as dendritic cells, macrophages, B cells," monocytes and other cells that may be 

engineered to be efficient APCs. Such cells may, but need not, be genetically modified 

■ -. - ■ .i-i . • ^ 

25 to increase the capacity for presenting the antigen, to improve activation and/or 
maintenance of the T cell response, to have anti-tumor effects per se and/or to be 
immunologically compatible with the receiver (/.^./matched HLA haplotype). APCs 
may generally be isolated from any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologbus, aliogeheic, syngeneic 

30 or xenogeneic cells. 
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Certain preferred embodiments of the present invention use dendritic 
cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent 
APCs (Banchereau and Steinman, Nature 3P2:2 ; 45-251, 1998) and have been shown to 
be effective as a physiological., adjuvant for eliciting prophylactic or therapeutic 

5 antitumor immunity (see Timmerm^n and Levy, Ann. Rev. Med 50:507-529, 1999). In 
general, dendritic cells may be identified based on their typical shape (stellate /#a 
with marked cytoplasmic processes (dendrites), visible in vitro), their ability to take up, 
process and present antigens ^th , high efficiency and their, ability to activate naive T 
cell responses. Dendritip cells, may, of course, be engineered to express specific cell- 

10 surface receptors or Ugands that are not cpmmonly found on dendritip cells in viy& or ex 
. v/vo» and. such modified dendritic cells are contemplated by the present invention. As 
an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be |ised ( within $ vaccine (see Zityqgel et al., Nature Med 4:594-600, 



15 Dendritic cells ^tid progenitor? may be obtained fj^pm peripheral blood, 

bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph 
f nodes, sp>leen, skin ? ijpbil^ other suitable tissue or fluid. For 

example, dendritic cells may be differentiated ex vivo by adding a combination of 
cytokines such as GM-CSF, , IL-4, IL-13 and/or TNFoc to cultures , of monocytes 

20 harvested from peripheral blood. Alternatively, CD34 positive : cells harvested from 
peripheral blood, umbilical cord blood or bone marrow ^ may be differentiated into 
, dendritic cells by adding to the culture medium combinatipns of GM-CSF, IL-3, TNFa, 
CD40 ligand, LPS, flt3 ligand ( and/or other compound(s) that induce differentiation, 
maturation and proliferation of dendritic cells. . 

25 Dendritic cells are conveniently categorized as "immature" and "mature" 

cells which allows a simple way to discriminate between two well , characterized 
phenotypes. . However, this nomenclature should not be construed to exclude all 
possible intermediate stages of differentiation. Immature dendritic cells are 
characterized as APC with a high capacity for antigen uptake and processing, which 

30 correlates with the high expression of Fey receptor and ma.nnose receptor. The mature 
phenotype is typically characterized by a lower expression of these markers, but a high 
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expression of cell surface molecules responsible for T cell activation such a$ class I and 
class II MHC, adhesion molecules (e.g., CD54 and GDI 1) and eostimulatory molecules 
(e.g., CD4b,CD8G,CD86and ^-lBB)i f : n 

APCs may generally be transfected with a polynucleotide of the 
5 invention (or portion or other Variant thereof) such that the encoded polypeptide, or an 
immunogenic portion thereof, is Expressed on thecellsurfatce. Such transfection may 

1 take place ex vivo, and a pharmaceutical composition comprising such transfected cells 
*> . 
may then be used for therapeutic ] pliiposes; as described herein; Alternatively, a gene 

(*,""* 

'delivery vehicle that ^argets^U dendritic cr other antigen^ presenting cell may be 
10 administered to a patient, resulting in transfection that occiirs in vivo. In vivo and ex 
- - vfv^ example, may generally be performed using any 

1 0 ' " methods kho^ in the art, such as those described in WO 97/24447, or the gene gun 
approach described by Mahvi et3 aL, Immunology and cell Biology 75:456-460, 1997. 
Antigen loading of dendritic cells may be achieved by incubating dendritic cells or 
15 progenitor cells with tlie tunior polypeptide, DNA (naked or within £ plasmid vector) or 
1 t ^NA; to with ^ ( e \g f vaccinia, 

fdwlpox, aJdenovirds 1 or : iCTtiVh^ ^ebtd»). l! Prior to loading the polypeptide may be 
f ' cdvaleritly -conjugated fo an immunological partner that provides T cell help (e.g., a 
■ camet molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated 
20 immunological partner, separately or in the presence of the polypeptide. r 

While any suitable carrier known to those of ordinary skill in the art may 
be employed in 1 the pharmaceutical compositions of this invention, the type of carrier 
will typically' vary depending on the mode of administfation; Compositions of the 
present invention may be formulated for any appropriate niknner of administration, 
25 including for example, topical, oral, nasal, mucosal, intravenous, intracranial, 
intraperitoneal, subcutaneous and intramuscular administration. 

^ • < Ca[rriers for use withiii Such pharmaceutical compositions are 
biocompatible,- ^ also be Biodegradable. r In certain embodiments, the 

formulation preferably provides a relatively l constsiht level of active component release. 
30 In other embodiments, however, r a more rapid fate of release' immediately upon 
administration may be desired. The formulation of such compositions is well within the 
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level of ordinary skill in the art using known techniques. Illustrative carriers useful in 
this regard . include jnicroparticles of polyCl^ctide-co-glycolide^i polyacrylate, latex, 
starch, cellulose, dextran and the like. , Other illustratiyq delay ed-rel ease carriers 
include supramolecular Tbiovectors, which comprise a non-liquid hydrophilic core (e.g., 
5 a cross-linked , polysaccharide < oro oligosaccharide^ and^. optionally,, an. external layer 
, comprising an-amphiphiJtic compound,; sucfy as a phospholipid (see e.g., y.S, Patent No. 
5,1 5 1 ,254 apd PCT applications WQ 94/2Q078 5f W9#4/237(U and WO %/06638). The 
? amount of active compound contained >^tlun a sustained depends 
t upon : the site of implantation the r^eai^d expected duration of j-ejeas^and the nature of 

10 the condition to «,•«.. ; v. * . i -. V i; 1 . - ;i 

><;■ u hi d In another illustrative; ejnbp$me$t, J^iptfegradabte , pd^pjspheres (e.g., 
: polylactate, potyglycolateX are jsn^ioyed as.iearriers,, for the compositions of this 
- invention. Suitable, biodegradable, microspheres are - disclosed, .fo^exainple^in U.S. 
i?am 4,897,268; .5j07£,14&-->5,928,$47; ' 5*81 l 4 ttfc : &*2fl,8?3;,. 5,853,763; 
lj5, v ^814344^^ g core protein carrier systems. 

: „r such as^escribed^niWtQ/^' 40^34, and : references cited therein, w$ alsojbe useful for 
,v ; / many #ppljcations. 80 : Ai?ojthfirr illv^^^^erl^Uy&ry . e@0W$ ^ c . arrier 

- comprising p^culate-protein, oomple^es, -§uch • as those de^ribed ! in ? iU^^ No. 
5,928,^47, which arej qapab.le,. of inducing a class. I-restricted cytotoxic T lymphocyte 

20 responses inal^st.r rM : ... . - ^ -r b ; , m 

i In anothef. illustrative, embodimea^calciumiphosphate core particles are 
7 employed as, farriers, vaccine; adjuvants, /or ap controlled release matrices for the 

compositions of this invention. Exemplar calpiym phosphate particles are disclosed, 

for example, ir\ published patent application No. WO/00461 47. , , 

25 t . The pharmaceutical compositions of the, invention will often further 

comprise one . or ; more buffers (e.g., neutral ^buffered, -saline or phppphate buffered 
, r s^li^e), carbohydrates ; (^g., ^i^ose^jrnannose, sucrose or dextrans), mannitol, proteins, 
,, , polypeptide^oor aigino acids such glycine, antioxidants, bacteripstats, chelating 

ageiits.such as EDTA or glutatjiiqme r a$uy^t%(e.g,, aluminum hydroxide), solutes that 
30. render the formulation isotonic, hypotonic ,or t weakly ^hypertonic with the blopd of a 
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recipient, suspending agents, thickening, agents and/or preservatives. Alternatively, 
compositions of the present invention may be formulated as a lyophilizate. 

The, phairnaceutical compositions described herein may be presented in 
unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers 
5 are typically sealed in such a way to preserve the sterility and stability of the 
formulation until use. In general, formulations may be stored as suspensions, solutions 
or erpulsions in oily or aqueous vehicles. Alternatively, a pharmaceutical composition 
may be stored in a freeze-dried condition requiring only the addition of a sterile liquid 
carrier immediately prior to use. 
10 The development of suitable dosing and treatment regimens for using the 

partiqulax ^ compositions described herein in f a variety of treatment regimens, including 
: £?.#, oral, parenteral, intravenous, intranasal, and . intramuscular administration and 
formulation, is well known in the art, some of which are briefly discussed below for 
general purposes of illustration. 

In certain applications, the pharmaceutical compositions disclosed herein 
may be delivered v/a oral administotion to an ^riimal. A^ such, these compositions 
may be foimulated with an inert diluent or with an assimilable edible carrier, or they 
may be enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into 
tablets, or they may be incorporated directly ydth the food of the diet. 
20 4 The active compounds may even be incorporated with excipients and 

, used in the form of ingestible tablets, bucqal tables^ troches, capsules, elixirs, 
suspensions, syrups, wafers, and the like (see, for example, Mathiowitz et aL, Nature 
. .1997 Mar 27;386(6623):410-4; Hwang a/., Crit Rev Ther Drug Carrier Syst 
1998;15(3):243-84; U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 
25 5,792,451). Tablets, troches, pills, capsules and the like may also contain any of a 
variety of additional components, for example, a binder, such as gum tragacanth, 
apacia, cornstarch, or gelatin; e^cipients ? such as dicalcium phosphate; a disintegrating 
, ; agent, sutch as corn starch, pqtatp starch, alginic acid and the like; a lubricant, such as 
magnesium, stearate; and a sweetening agent, such as sucrose, lactose or saccharin may 
30 be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, in addition to 
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materials of the above type, a liquid carrier. Various other materials may be present as 
coatings or to otherwise modify the physical form of the dosage unit For instance, 
tablets, pills, or capsules may be coated with shellac, sugar, or both. Of course, any 
ihateriai used in preparing any dosage unit form should be pharmaceutically pure and 



substantially non-toxic in the amounts employed. In addition, the active compounds 



may be incorporated into sustained-release preparation and formulations. 



Typically, these formulations will contain at least" about 0.1% of the 
active compound or more, although the percentage of the active ingredient(s) may, of 
course, be varied and may conveniently be between about 1 or 2% and about 60% or 



1 0 70% or more of the weight or volume of the total formulation. Isfaturally, the amount of 



15 



'■■*noi'h; i c<ff 



active compound(s) i in each therapeutically useful composition may be prepared is such 
a way that a suitable dosage will be obtained, in any given miit dose of the compound. 
Factors such as solubility, bioavailabilityrbiological half-life, route of administration, 
product shel)T life, as well as other pharmacological considerations will be contemplated 
by one skilled in the art of preparing such pharmaceutical formulations, and as such, a 
variety of dosages and treatment regimens may be desirable. 

For oral administration the compositions of the present invention may 
alternatively be incorporated with one or more excipients in the form 01 a mouthwasn, 
dentifrice, buccal tablet/ bral spray, or sublingual ordly^dmimstered formulation. 
20 Alternatively, the active ingredient may be incorporated into an oral solution such as 
one containing sodium borate, glycerin and potassium bicarbonate, or dispersed in a 
7 dentiJfrice, or added in a therapeutically-effective amount to a composition that may 



25 



include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 
Alternatively the compositions may be fashioned into a tablet or solution form that may 
be placed under the tongue or otherwise dissolved in the mouth. 

1 In certain circumstances it will be desirable' to deliver the pharmaceutical 

compositions disclosed herein pareriterally \ intravenoxisly , ^intramuscularly, or even 
iritraperitoneally! Such approaches are well known toiie skilled artisan," sbnie of which 
' ! are further described; for exkrnp'le, in ? U: S^t^ki ZMsfczW^. Patent 5,641,515 
30 and U. S. Patent 5,399,363. In certain embodiments, solutions of the active compounds 
' ' as free base or pharmacoiogicaliy acceptable salts may be prepared in water suitably 
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mixed with a surfactant, , such as hydroxypropylcellulose. i ■ Dispersions may also be 
prepared in glycerol, liquid polyethylene ^glycols, and mixtures thereof and in oils. 
Under ordinary conditions of storage and use, these preparations generally will contain 
a preservative to prevent the growth of microorganisms. . ; , . . 
5 ..; . <- : ^ Illustrative pharmaceutical rforms suitably for injectable use include 

sterile . aqueous solutions or dispersions j and sterile powders for the extemporaneous 
preparation of sterile injectable solutiohs or dispersions (for example, see U. S. Patent 
5^466,468); , In all cases the form .must be sterile and* must be fluid to the extent that 
easy syringability exists. It must be stable | under: the rconditions of manufacture and 
10 storage ; and ^must be ^preserved against the: contaminating action; of mieroorgarjrisms, 
si such as bacteria and fungi. JThe; carrier can be a, solvent or dispersion medium 
/y; containing, for e&ample,; water, ethanol, polyol fag., glycerol, propylene glycol, and 
liquid polyethylene glycol, and the like):, suitable mixtxjres . thereof, arid/or vegetable 

4 

oils. * Proper fluidity may.be maintained,* for example^ by the use of a coating, such as 
1 5 lecithin, by the maintenance of the required particles size^in thecase of dispersion and/or 
by the use of surfactants. The prevention <of the actioncHf microorganisms can be 
: ^facilitated by ; various .antibacte^ example, parabens, 

<fi'< chlorobxitanolj ph&iol, sorbic ^cid, thimerosal,* and the like. In many cases, it will be 
j preferable to include isotonic; agents** for* example, sugars or sodium chloride. 
20 Prolonged absorption of the injectable compositions can be brought about by the, use in 
the compositions of agents delaying absorption^ for example, aluminum monostearate 
> and gelatin-: t i v ;, : > 

^ >' In one embodiment, for parenteral administration in an aqueous solution, 

the solution should be suitably buffered if necessary and the liquid diluent first rendered 
25 isotonic with sufficient saline or glucose. These particular aqueous solutions are 
especially suitable, for intravenous, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection a sterile aqueous medium that can be employed will 
be known to those bf skill in the art in light ;of*the present disclosure. For example, one 
dosage may be dissolved in 1 ml of isotonic NaCl solution arid either added to 1000 ml 
30 of hypodermoclysis fluid or ihjeteted at thoproposed site of infusion, (see for example, 
"Remington's; Pharmaceuticals Sciences" 15th Edition, pages 1035-1038 and 1570- 
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1580). Some variatiori4h dosage will necessarily occur depending on the condition of 
the subject being treated. * Moreover, 1 for human administration, preparations will of 
- course preferably -meet sterility/ pyrogenicity, and . the general safety 'and purity 
standards as required by FDA Office of Biologies ^tafidards. - - 
5 - in another embodiment of the -invention/ the compositions disclosed 

- therein may be formulated in u neutral or salt form. ^ Illustrative 
i f. pharmaceuticaliy-^cceptable salts: include ithe acid addition salts (formed, with the free 

amino groups of the/ protein) and \Vhichi are r formed with -inorganic &cids: such as, for 
^xamplei, hydrochloric or phosphdric adid$> or . <such ^organic: acids as acetic, oxalic, 
10 x tart&ic, maridelic,,and ffhe-like^ Salts foiled with therfrefe carboxyl groups can also be 
•.,:/• : derived iftom inofgante bases > such asy for example, sodium,^potdssium, :ammoiiixim, 
r r calcium; or ! ferric hydroxides,; ,and~ i'such organic abases' as isopropylamine, 
)c trimethylamin^, Wstidine, procaitieCand the like. Upon fornhulation, ablutions \yill be 
o administered in a mdnnej^ compatible with the dosage" formulation' and in such amount 
15 r i^as is therapeutically effective ^ ^ ^ ■ - 1 ' 

so ?.mz nsprtv-.rir ■ The e^eisifc^furth^ 

- * 1 vehicles, coatingsr^iluetttsi,taii^teacf0riBl ^ahd santifuogal agents, x isotbiiHQ ;ihd; absorption 
■v delaying agents, buffers, carriet solutions, suspensions^ 'colloids, >an6Ltfae dike. The use 

of such media attd agents for pharmaceutical Lactiv© substances is; well known in tlie art. 
20 Except higofar as any conventional media or agent is incompatible .with the * active 
ingredient, its use in the therapeutic compositions is contemplated. Supfplementary 
active ingredients can also be incorporated into the compositions, The phrase 
"pharmaeeutically-acceptable ,t refers to molecular entities and compositions that do not 
r produce an allergic or similar untoward reaction when administered to a human. 
25 : ; - In certain embodiments, the -pharmaceutical Compositions may be 
* delivered by intranasal sprays, inhalation,; i and/or other aerosol delivery vehicles. 
Methods for delivering genes^ nucleic/acids, and . peptide compositions . directly to the 
• lungs v/a nasal aerosol sprays ihas been described; ^g. / in U^S.<Patent 5,756,353 and U. 
7 > S;- Patent 5,804,212. Likewise, ^the delivery, of -drugs? using intranasal, microparticle 
30 resins. (Takehaga* et:Mh y ^< ^Controlled* < JElelease;,, 1998. -Mar * 2;52(l-2):81-7) and 
* - lysophospMtidyl^glycerol compounds (U. ^ Patent 5,725,871) are; also well-known in 
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1 the pharmaceutical arts. Likewise, illustrative transmucosal drug delivery in the form 
of a polytetrafluoroetheylene support matrix is described iiiU. S. Patent 5,780,045. 

In certain embodiments, liposomes, nanoc&psules, microparticles, lipid 
particles, vesicle^ and the life, are used for the introduction of the compositions of the 
5 present invention into suitable host cells/organisms. In particular; the compositions of 
the present invention 1 may be fdhnulated for delivery eithbr encapsulated in a lipid 
particle; a liposome, a vesicte, a riariosphere, or a haiioparticle or the like. 
Alternatively, dompositions of thfe present iriv^ntidn can be bound, either covalently or 
non-covalently, to the surface of such carrier vehicles. 
10 The formation and use of lipoSonie and lipois6ihe4ike preparations as 

- potential drug carrier^ is generally kfioSvri td i those^oiF skill ; ii! the art (see for example, 
! 1 LasSc, " Tt&tite ; BiotecKnol 1 998 Jul; ! l S(7):307-2 1 ; Takakura; v N^pon Rinsho 1 998 
' " fti*ir?56(3):691*-5; Chaodran W'U,*fiidian PExp Biol. ? 1 997 Aug;35(8):801-9; Margalit, 
Crit Rev Ther Drug Carrier SySt.' 1 995 ; 1 2(2-3):233-61 ; U.S. -Patent 5,567,434; U.S. 
1 5 Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5,73 p68 and U.S. Patent 
5,795,587, each specifically incorpor'kedn^ ' 

r ' J ' n ' li^o^omes hav^ fefcn ii^ed ^^ucceissfully with a number of cell types that 
are normally difficult to transfect by other procedures, including T cell suspensions, 
primary tiepatdcyte ciiltures arid' PC 12 cells (Renneiseh et al, J Biol Chem. 1990 Sep 
20 r t !25;265(27)i 1633742; Mulleref 'a/., DNA Cell Bitil. 1990 Apr;9(3):221-9). In addition, 
liposomes are frefe of the DNA length constraints that are typical of viral-based delivery 
systems. Liposomes have" been used effectively to introducie genes, various drugs, 

r 

radiotherapeiitic agents, enzymfes, viruses, transcription factors, allosteric effectors and 

the like, into a variety of cultured cell lines ^aiid animals. Furthermore, he use of 
25 liposomes does hot appear tci be associated with autoimmune responses or unacceptable 

toxicity iafter systemifc cieliveiy. 

In certain embodiments, lipbsolries are formed from phospholipids that 

are dispersed in an aqueous medium and spontaneously form 1 multilamellar concentric 

tiiayer Vesicles (also tern 
30 AltesAatively, ih other eihbodimerits, the iriveritiotf = provides for 

phiarmaceutically-acceptable nanocapsule formulatibns of the 1 compositibns of the 
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^present invention.. T^anocapsuJes can generally entrap compounds in a stable and 
reproducible >yay (see, for example, Quintanar-Guerrero et al, Drug Dev Ind Pharm. 
1998 Dec;24(12):l 113-28). To avoid, side effects diie to intracellular polymeric 
overloading,, such ultrafine particles (sized argund. 0.1 .urn} may, be designed using 
■ ; 5; polymers able to.be. degraded in yivo. .Such particles can be ma.de as. r described, for 
.: .,v s r example, by Cou-vreur et al , Crit Rev Thejr Drug (fang gyp.. . 1 988;5Q):1 -20; zur 
Muhlene/ a/., ; Eur J Phann^Biopharm. 19^8..,M?T»45^):149r55^ Zambaux et al. J 
.-■a v * ! / : Controlled Release^- 1 9-98, Jan 2;50(1-3):31 ( -4P; and y.tS.Patent 5,145,684, •. 

; , , Cancer Thej^ejjtjc Metijqps v . br; ., ..... . . ;! •■ ; 

: '40 , .-- - .:■ -r, JrpmunplQgic apr^paches .to cancer ^*agy:.^i», bassd .^n..^ ) recognition 
\ 1 ... f .that cancer cells s car* often cyade the body's defenses against aberrant or fqreign cells 
, ; , and mplpcules, jand %t tfcese, ^efensss xnight^bq.thei^peutipajlyj stimul^tecMq regain the 
i r > lost groun^>e.g. pgs. 623-64,8 ip Kleiji, Immunology (^ilfy-Interscience, New York, 
1982). Numerous recent observations that various immune effectors .can directiy or 
15 indirectly ,i|^it^g^,9f.|U|ifOr§ ( .h3§ Jed tq renewed interest in this^ approach to cancer 
T v ; ,:,t , r . , therapy^e.g. u Jager, :et a^, Oncology, .200 1;6Q(1 ) : 1 -T, Pseimer;, et al., Ann Hematol 2000 

• ■•, • Pec;79(12):65H. ,,,, , . ;; ^ - ^ , . , :> , -^^ 
. ( ....... < Four-basic qell types whose function l{ ha§ been .associated with antitumor 

b cell immunity and the elimination of tumor cells, .from the body ?u*e: i) B-lymphocytes 
20 which secrete immunoglobulins into the , blopd plasma for identifying and labeling the 
jionself invader cells; ii) monocytes which secret^ the complement proteins that are 
responsible for lysing, and processing. the imipunoglobulin-coated garget invader cells; 
iii) natural killer lymphocytes having two mechanisms for the destruction of tumor 
cells, antibody-dependent cellular cytotoxicity and r natural killing; and iy) T- 
25 lymphocytes possessing antigen-specific receptors and having the capacity, to recognize 
a tumor cell carrying complementary marker molecules (Schreiber, H., 1989, in 
f ; Fmdamjept^ Inrniunplogy^ r : , ^ f 

Cancer immunotherapy generally focuses on inducing hump?;al immune 
. -.. responses, cellular immune responses, or berth. MOTeoyer^ it is well established that 
30 . .induction of GD^ + T helper .cells,, is necessary in order to, secondarily induce either 
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antibodies or cytotoxic CD8 ? T cells. Polypeptide antigens that are selective or ideally 
specific for cancer cells, particularly colon cancer cells* offer a powerful approach for 
inducing immune responses against colon cancer, and are an important aspect of the 
1 present invention; - : ^ ^ ^ 

5 ' t " Therefore^ in further aspects of the present- invention, the pharmaceutical 

compositions' described hferein may be vised to stimulate an immune response against 
- cancer, particularly for the immunotherapy of colon cancer. Within such methods, the 
{ pharmaceutical 6oilipbsitibns ,x describeH Mreiri are administered to a patient, typically a 
- f warm-blooded animal, preferably a'human. ^ipatient may or may not be afflicted with 
10 ; ; cancer. ^ Phahhiaceutiic^ ctim^ositions iarid' vaccines may be administered either prior to 
1 6r * ^Kdwii^g surgical removal of ' rpriMatj^ ^tumors • * and/or- •/ 'treatment such as 
administration of radiotherapy of conveittional chemotherapeutic drugs! r As discussed 
r above, administration of the pharmaceutical compositions ' may be by any suitable 
method, including administration by intravenous^ intraperitoneal; intramuscular, 
15 subcutaneous, ihtrariiasali intradermal, anal, vaginal, topical afod oral routes. . : f 
; v, * : ^ Within * certain embodimb&ts^ ^ ^ ^ithinunotherapy may be active 
?i iniWunbffi^ whifeh treatment rdi^s r oh tiie m vrvd stimulation of the endogenous 

( host immune' 1 system to react -against turiiors with the ^administration of immune 
response-modify ihg agents (such as' polypeptides and polynucleotides as provided 
20 herein): ' ; 1 - J : v 

^ v Within- other embodiments, immunotherapy may be passive 
immunotherapy, in which treatment involves -the delivery of ' agents with 5 established 
^tximor-immurie reactivity (such *as effectcfr cells or antibodites) that can directly or 
indirectly mediate antitumor effects and does not necessarily depend on an intact host 
25 immune system. 1 : Examples of r effectdr cells include T cell^ as discussed above, T 

lymphocytes (such ais GD8 + cytotoxic T lymphocytes and GD4 + T-helper tumor- 

■ *"i . ...... * 

mfilfeatrng lymphocytes)^ killer cells (such as Natural Killer cells and lymphokine- 
activated killer 6ells) 1 , B> cells atnd antigfen-presentihig cells (such as dendritic cells and 
1 niacmphajges) expressing a polypeptide provided herein. T cell receptors and antibody 
30 receptors specific for the polypeptides recited herein may be cloned, expressed and 
transferred into other vectors or effector cells for adoptive immunotherapy. The 
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polypeptides; provided herein may also be used to generate antibodies or anti-idiotypic 
( antibodies (as .described above and in U.S. -Patent No. 4,918, J64) for passive 

immunotherapy, f s . : . .jr.. 

Monoclonal antibodies may be labeled with anyjof a yariety of labels for 
5. desired selective, usages in defection^ diagnostic assays or therapeutic applications (as 
described . in • VS. Patent - Nos>- £ 9 Q90,3j6M ,-^01.5,542?. • •5 i 845,39«; 5,5.95,721; and 
4,708,930, hereby incorporated byrtefermps in jtxeir entirety a? if .each was incorporated 
; , i individually). In eacb'cas^ the ,bw#ng of jthe labelled mcRipplppal antibody to the 
(• • v>- ? iidetermina^t i site of ; theMantigen ^U^-isignat^jleteiCtioii.^ .• de^very ,of -a particular 
r .10., therapeutic agent to the antigenic determinant on the n^n-norn^ ©elL A fi^rther- object 
; of this > invention^ is to > provide ; tfie> epseifie monoclonal anybody suitably labelled for 
c : j .: .achieving svfeh desir^ selefetiN^ t us^es thereof. r Ui Klv \ ^ . i-Mr^x 

. : ; Effector pells moty generally^ be; iobjained^ in; suffiqient quantities for 

^ u s adoptive immunotherapy by growth inyitrq, as despfib§d herein.^ Culture conditions for 
15 expanding; i single ^aiitigen-specjfip effector . cells ; ,to several; billion in niimber with 
3>-i)o.*5 u retention of antigen xs^gnitio^ well : known in th^ art. Such in vitro culture 

;>:\ i: ^iv&ii conditions typic£dly wse inteifr jt|en$ ; Stimulation wijhjantigei^^pft^ij ^ presence of 
,;<\;a i cytokines . (such as ,ib-2) and] non Tj divi4ing feeder cflls, . As Rioted above, 
. . immunorsactive polypeptides a$ proyidetf c j^rein L may be -;u?ed to rapidly expand 

20 antigen-specific T cell cultures in order to generate a sufficient number of cells for 
immunotherapy. In particular, antigen-presenting cellSj s^ph as dendritic, macrophage, 
monocyte, fibroblast and/or B cells, may be pulse<J with r immunoreactive polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 
i in the - art. For example, antigen-presentjlng ^ells c^n ^b^ transfepted with a 

25 polynucleotide having a promoter ; appropriate ; for increasing expression - in a 
recombinant virus or other sxpressionsystein. Cultyred effectpr cells for u?e in therapy 
1 \ . must be able to grow and distrib^ Studies 
: ; :;U;v haye shown? ^that cultured effector cells can bevjndnepd ^Jp : gfow m viyo. and to survive 
long term in substantial pumpers by ^frepeatpd stimiilation with antigen supplemented 
/, 30. :< with lLy2%see, for exapiple, Cheever : et al., Imr^tunoJogi^l Reyiews 157: 1 77, 1 997). 
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Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting cells takeft from a patient and clonally propagated ex 
vivo for transplant back into the same patient. Transfected cells may be reintroduced 
into the patient using any means known in the art, preferably in sterile form by 
! intravenous, intracavitary, intraperitoneal or intratumor administration. 

Roiites and frequency of administration of - tHe ther^ 



described herein, as well as dosage, will vary froth individual to individual, and may be 
readily established using standard techniques. In general, the pharmaceutical 
compositions and vaccines may d6 administered by injfebtion (e;jf:, intracutaneous, 
10 intramuscular, intravenous or subcutaneous), ihtrahaLsally (^^.^ by aspiration) or orally. 
Preferably, between"! and 10 doses may ' be kdihimstdred oVer : a 521 week period. 



"Preferably, 6 doses are administered, kt intervals of 1 month^ aiid booster vaccinations 
may be given periodically thereafter. Alternate protocols may be appropriate for 

■ 1 - ; . 1 • ■ ...... 

individual patients. A suitable dose is an amount of a compound that, when 
15 administered as described above, is capable of prbihbtih'g 'ah v ahti-tumor immune 
response, and is at least 10-50% above 1 the l>asai (/ Untfeaied) 5 level.' Siich fe'sponse 
can be monitored by measuring the anti-ttimor Mtibodifes' m a patient or by vaccme- 
dependent generation of cytolytic effector cells capable of killing the patient's tumor 
cells in vitro. Such vaccines should also be capable Of causing an immune response that 
20 leads to an improveci clinical outcome {eg., more frequent remission^' complete or 
partial or longer disease-free survival) in vaccinated patients as compared to non- 
vaccinated patients.' In general, for pharmaceutical compositions arid vaccines 
comprising one or more polypeptides, the amount of each polypeptide present in a dose 



! j 



ranges from about 25 fig to 5 mg per kg of host. Suitable dose sizes will vary with the 
25 size of the patient, but will typically range from about 0.1 mL to about 5 mL. 



In general, 1 an appropriate dosage and treatment regimen provides the 
active compound(s) in an ambuhf sufficient to provide therapeutic and/or prophylactic 
benefit. Such a response can be monitored by establishing an improved clinical 



outcome (e.g., more frequent remissions, complete or partial, of longer disease-free 
30 survival) in treated patients as compared to non-treated ^ patients. 4 Increases in 
preexisting immune responses to a tumor protein generally correlate with an improved 
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clinical outcome. Such immune responses may generally be evaluated using standard 
proliferation, cytotoxicity or cytokine assays., which may be performed using samples 
obtained from a patient before and after treatment 



All 



10 



Cancer Detection and Diagnostic Compositions, Methods and Kits 
5 In general, , a cancer n^ay be detected in a patient based on the presence 

of one or more colon tumor proteins and/or polynucleotides encoding such proteins in a 
biological sample (for example,, bloo<i, c sera, sputum urine and/or tumor biopsies) 
:,v- 0i obtame&,^^ I^. other words, such proteins may be used as markers to 

indicate the presence or absence of a cancer such as colon cancer. In addition, such 
proteins may be useful for the detection of other cancjers. The binding agents provided 
herein generally permit detection of the level of antigen that binds to the agent in the 

- biological sample. , , , 

^ t, f|j . (V i Polynucleotide primers and probes may be used tp detect the level of 
jpaRNA encoding a Juipor protein, which is also indicative of th$ presence or absence of 
nfsr , 15, j -a cancer. In general, ajumor sequence ^hould be; present at a level that is at least two- 
, fold, preferably three-fold, and more preferably five-fold or higher in tumor tissue than 
in normal tissue of the same, type from which the tumor arose. Expression levels of a 
particular tumor ; sequence in tissue types, different from Jhat in which the tumor arose 
are irrelevant in certain diagnostic embodiments since the presence of tumor cells can 
20 be confirmed by observation of predetermined differential expression levels, e.g., 2- 
fold, 5-fold, etc, in tumor tissue to expression levels in normal tissue of the same type. 

Other differential expression patterns can be utilized advantageously for 
diagnostic purposes. For example, in one aspect of the. invention, overexpression of a 
tumor sequence in tumor tissue and normal tissue of the same type, but not in other 
normal tissue types, e.g. PBMCs, can be exploited diagnostically. In this case, the 

■ J " 1 ' ' • 1 ■ - ' 1 1 , V f. ,' J I. 

presence of metastatic tumor cells, for example in a sample taken from the circulation 
or some other tissue site different from that in which the tumor arose,, can be identified 



25 



anchor confirmed by detecting expression of the tumor sequence in the sample, for 

■ ■* * 1 " ^ * ****** * * * * ■ * . i * 

.* * 

example using RT-PCR analysis. In many instances, it will be desired to enrich for 



■ it - ■ 
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tumor cells in the sample of interest, e.g., PBMCs, using cell capture or other like 
techniques. 

There are a variety of assay formats known to those of ordinary skill in 
the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., 

■ > ^ - : ■ j. f • -.,.,-....( ... , > , - - ■. . v | ; v. ■ r .- • i . 

Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 
1988. In general, the presence or absence of a cancer in a patient may be determined by 



(a) contacting a biological sample obtained from a patient with a binding agent; (b) 



detecting in the sample a level of polypeptide that binds to the binding agent; and (c) 
comparing the level of polypeptide with a predetermined cut-off value. 



10 In a preferred embodiment, the assay involves the use of binding agent 

iiiunobilized on a solid support to bind to and remove the polypeptide from the 
remainder of the sample. The bound polypeptide may then be detected using a 
detection reagent that contains a reporter group and specifically binds to the binding 
agent/polypeptide complex. Such detection reagents may comprise, for example, a 

1 5 binding agent that specifically binds to the polypeptide or an antibody or other agent 
that specifically binds to the binding agent, such as an antiimmunoglobulin, protein G, 
protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized 
binding agent after incubation of the binding agent with the sample. The extent to 

20 which components of the sample inhibit the binding of the labeled polypeptide to the 

j 

binding agent is indicative of the reactivity of the sample with the immobilized binding 
agent. Suitable polypeptides for use within such assays include full length colon tumor 



1 1 



proteins and polypeptide portions thereof to which the binding agent binds, as described 
above. 



25 The solid support may be any material known to those of ordinary skill 

in the art to which the tumor protein may be attached. For example, the solid support 



1 ! 



may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
4 Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyyinylchloride. The support may also be a 
30 magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 
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. ? using a variety of techniques known to those of skill in the art, which are amply 
described in the patent and scientific literature. 1 In the context of the present invention 
, the term "immobilization" refers to both noncovalent association, such as adsorption' 
< ^ covalent attachment (which may be a direct linkage between the agent and' 
5 functional groups on the support or may be a linkage by waf of a cross-linking agent) 
Immobilization by adsorption to a well in a microtiter pkte or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting ihebmding agent 
in a suitable buffer, with the solid support for a suitable amount of time. 'The contact 
time varies with temperature, but is typically between about l hour and about 1 day In 
10. gen^l, «^ ^ 

polyvm^chloride) with an amount of binding agent ranging from about 10 ng to about 
10 Mg, and preferably about 100 ng to about 1 ug, is sufficient to innnobilize an 
adequate amount of binding agent. ; ' T * ■ - 



( Covalent attachment of binding agent to a solid Support may generally 
be achieved by first reacting the support with a Afunctional re^ent'tnat will react wim 
O r.™,,^ ^ ^.^ I S™P> nuch as a hydroxyl or amino group, on the 

,^, Iv p b ^ a f ent ; ^f^Plf, binding^gent may be covaiently attached to supports 
, ; , ^ Vmg m a PP r °P riate Polymer' coating using benz^quinon^ or by^en^tion of an 
7 , } - , ald6h ? & ° UP °» the ! SU PP° rt - amWand an active nydr^en^me binding 

/ ..v 2 °. , partner e f • Pierce Irnmunotechnology Catalog and handbook; 1991 'at 

A12-A13). : '< ' ' 

c 



15 
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In certain embodiments, the assay is a two-antibody sandwich assay. 
This assay may be performed by first contacting an antibody that has been immobilized 
on a solid support, commonly the well of a microtiter pk£ with the sample, such that 
polypeptides within the sample are allowed to bind to the immobilized antibody. 
Unbound sample is then'removed from the immobiiized polypeptide-antibody 
complexes and a detection reagent (preferably a second antibody capable 'of binding to 
a different site on the polypeptide) c<>niaini4 ar^'g^i^ek! The'amountof 
detection reagent that remains bound to the solid support Is thfen determined using a 
30 method appropriate for the specific reporter g^up: q _ ^ ' L; .. 



• u ' : ' i! '' "■ '- ! : :- ■ ■ ■■»: ; ' . ■.>?; 
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More ; specifically, once the antibody ,is irnmpbilized on the support as 
described above, the remaining protein binding sites , on the, support are typically 
blocked. Any suitable blocking, agent, known to those of ordinary skill.in the art, such 
as bovine ,serum albumin or Tween 20- (Sigma Chemical Co., St. Louis, MO). The 
5 immobilized antibody is then incubated with the sampled polypeptide is allowed to 
. ... bind to the antibody. The , sample may, be diluted, with actable diluent, such as 
; - phosphate-buffered, saline (PBS);pripr to incubation., ; In general, an appropriate contact 
; .. .-time (U, incubation time) is a period ^time that is sufficient to detect the presence of 
. polypeptide within a sample obtained, from an indiyidual with *olon cancer at least 
7 l0 about ,95% of that achieyed at ecjurjibrium b^n bopui a^.unbou^.polypeptide. 
, \. Those of ordinary .skilljn tbe.art^Ul recognize, that the-time necessary to achieve 
equilibrium may be readily deterged by assaying the level of binding that occurs over 
.a.-period pf time. At room temperature, an incubation. time of about 30 minutes is 

-generally suffopient. f - ^ ■ ; 

15 . . , .Unbound sample may to 

' ,,' , with an appropriated, such ,as P B S^mng,Q,l% .Tween. 2()T^The second 
; , m ^;^,c^^n V ^^^ Wi then be. added, to. the- solid support. 

,,. ffreferredwpartf* groupsiinclu4e fhos^grqups t&m 

t Th e detection reagent is then, incubated, with the immobilized antibody- 

polypeptide complex for an amount of time sufficient*) detect the bound polypeptide. 
An appropriate amount of time .may .generally be determined by assaying the level of 
binding-that occurs, over a period of time., Unbound detection reagent is then removed 
and bound . detection, reagent is detected using the reporter group. The method 
employed for detecting. the reporter, group depends, upon the nature of the reporter 
group. For radioactive groups, scintillation counting or autoradiographic methods are 
■ generally appropriate. , Spectroscopic methods may be used to detect dyes, luminescent 
groups, and- fluorescent, .groups,, Biotin. may be detected using avidin, coupled to a 
u ... different reporter .group (commonly a radioactive or fluorescent group.or an enzyme). 

* Enzyme repPrter. group, ,m,y .generally be , detected by ■ the addition, of substrate 
30 .(generally for a specific period of t ime)>; followed by spectroscopic or other, analyse of 
•.-theieadfion'produiGts, . • • i f 



20 



v25 



WOOI/96390 



PCT7US01/18577 

102 



• - l To determine the presence or absence of a cancer, such as colon cancer, 
v ' the signal ^detected from the reporter group mat remains bound to the solid support is 
generally compared to a : signal that corresponds' to & predetermined cut-off value. In 
'one preferred embodiment, the cut-off value for the detection of a cancer Is the average 
5 <* mean' signal Obtained when the immobiIi2iea antibody is intubated with samples from 
"patients without the cancer. In general;' a sample generating a> 'signal that is three 
standard deviations above the predetermihett' cut-off valuers 1 considered positive for the 
: cancer. In an alternate preferred embodiments the cut-off value is determined using a 
:! "R&ema ,: <)§etkttfr ,: >Cilfr#, according' to the 'method of Sackett -et^ al:, Clinical 
m - 'Epidemidlogy: A Btisic ^Science for 1 iDlimtW'MedidiHei Lirfle'firbWn and Co.,' 1985, 
- 3 p. \ 06-7. Biieflyi ; in this embodiment, the cut-off Value may -be determined from a plot 
of pairs df^tftie positive' rates ; (i.e., sehs!tivity) and false ' positive' 1 *ates (100%- 
'•' specificity) 1 that correspond to each possible cut-off value for the.diagnostic-test result^ 

The cut-off value on the plot that is the closest to the upper Jeftvhand corner (i.e., the 
15 value that encloses' the largest area) is the rriost Accurate r cut-off value, and a sample; 
,? fe<Mera&g*s4gnaf tHatag^highe*-tHati f the* Mii&d&ermrtieA by -this Method may 
' ; tfe' considered'^ may be ; 'shifted *b ; 1flie left along?? 

the plot, to minimize thd"^^--^^©^--^ the --right,-** mimrfiize the false? 
negative rate.; In general; a sample generating a signal that is higher than the cut-off 

r - ' f 

'.0 value determmed by mis m , . r - 

In a related embodiment, the assay is performed in a flow-through or 
1 strip test format; wherein the binding argent -is immobilized on a membrane, such as 
nitrocellulose. In the flow-through test; polypeptides- within the . sample bind to the 
immobilized binding agent as the sample passes through > the membrane, i A second, 
5 labeled binding agent then binds to the binding agent-polypeptide complex as a solution 
1 containing the second binding >ageinf ffofts through' the membrane. The detection of 
' bound second binding agent may then be performed as described above; vln the strip test 
format, one end of the membrane' to Vvhfch binding agent ;is bound ;is immersed in a 
'solution -"containing the sample.c' The sample migrates along, the membrane through a 
10 ^region containing seqond' binding agent and to the area of immobilized. binding agent. 
Concentration of second binding agent at the area of immobilized antibody indicates the 
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presence of a cancer. Typically, the concentration of second binding agent at that site 
1 generates a pattern, such as a line, that can be read 1 visually. The absence of such a 
pattern indicates a negative result. In general, the amount of binding agent immobilized 
on the membrane is selected to generate a visually discernible pattern when the 
! 5 biological sample contains a level 1 of polypeptide that would be sufficient to generate a 
positive 1 signal in the two-antibctfy sandwich assay; in the format discussed above. 
■ ' Preferred binding agents for use' in such assays are antibodies and antigen-binding 
* fragnients thereof. Preferably, the 1 arnbunt of- antibody immobilized <on tfae membrane 
vt ranges from about 25 tig to about 1 p$ and more pfe^erably r fr6m about 50 ng to about 
1 0 : 5'ob ng. Such tests can typically be 'performed with a Very small amount of biological 
sample. 

Of course, numerous other assay protocols exSf mat are suitable for use 
1 ^ ; wkh' the tumor proteins -or binding agents" of the present invention. The above 
; ■'■ descriptions are' intended to be exeniplary only. 1 For example/ it will be apparent to 
15 r those of ordinary skill in the art that the abdve protocols may be reaBilymbdified to use 
1 tumor polypeptides to detect antibodies r that bifid to l su&i polypeptides in a biological 
• sample; ThVdetectton of such tumot pi^te^ correlate with the 

presence of a cancer. 

.? ' ;f ' • r : ' -A'caneer may 'also, or alternatively 5 , berd'etected based on the presence'of 

20 ' T cells that specifically react with a tumor protein in a biological sample. Within 
-certain methJdsl a biological' sample comprising CD4 + and/or CD8 + T cells isolated 
" from 'a patient is incubated with a tumor polypeptide, a polynucleotide encoding such a 
' polypeptide 'and/or an APC that expresses at least an immunogenic portion of such a 
' ! polypeptide, aiid the presence or absefice of specific activation of the T cells is detected. 
25 v Suitable biological 1 samples include, but are not limited to, isolated T cells. For 
' example; T cells may be isolated from a patient by routine 5 techniques (such as by 
FieollTHypaque density gradient centrifugatioh of peripheral blood' lymphocytes). T 
1 ' cells inay oe incubated in titrb for 2^9 days (typically 4 days) at 37 6 C wmV polypeptide 
(e g^ 5 _ 25 (ig/ml). It may be desirable to incubate Another aliquot of a T cell sample 
30 in the absence of tumor polypeptide to serve as a control. For CD4 + T ceils, activation 
is preferably detected by evaluating proliferation of the T cells. For CD8 + T cells, 
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activation, isi preferably detected by evaluating y cytolytic activity. A level of 
. . : . . proliferation that is aj le?ast two fold greater and^or a levej , of cytolytic activity that is at 
; ; least 20% greater, than in disease-free patients indicates the presence of a cancer in the 
- patient. t , , ;/ , ; - .. ; , ,^ . ., 

5 5 : r . •( As jipted #bove, a pancer may .sJsp, qr alternatively,! be detected based on 

■ v the level of i^RNA encoding a tpmor protein in a biological; sample,. For example, at 
, ( least two oligpnupleqtjde printers ma^ b^ ^employjed ip & pqlymerase chain reaction 
v } (PCR) base4 ass^y tq : a^np^fy^a pqrjtiqn qf a tupiot cDN^ : derived from a biological 
' i, i ■ sample, > wherdn , at. le^st^pnq of > the, pljgqnuclep^er primprs is, specific for (i.e., 
I AO hybrjidiz^s tp) ; a p0lynuqle9ti.de, .enpqdin^ th^ tippor r protein. The amplified cE)NA is 
then separated and detected using techniques well known in ,the rart, such as gel 
„ , - ^ e^ctropfeor^si^. t , t ; .. ;i j0 , T _ ..... ; ;J -» 0 

, >; Sijm}arly iT hybridize to a 

, y , ^ : polynucleotide er^qqdiiig a tumor protein may be used in a hybridization assay to detect 
L ii vij 1? tfy^ prqs^nce of polynupkotide encoding t the tujaiqr protein in a biological sample.; , f 
^nigok.W ;: t i$ [y : s ^ oHgonupleptide primers 

u :i;> .;/and P ro ^ e ? sh ?y!^ at, Jeast .about 60%, 

preferably at least about 75% and more preferably at JeastAbo.ut $0S£,Jdentity to a 
, portion of a polynncleotide encoding a tumor protean of, the /invention that is at least 10 
20 nucleotides, and preferably at least 2,0 nupleptide^, in length. Preferably, 
* oligonucleotide primer^ and/or .probes hybridize to a polynucleotide encoding a 

polypeptide described herein undpr moderately stringent conditions, as defined above. 
, Oligonucleptidje. primers .aqd/or probes which paay be usefuJJy employed in the 
... ri . diagnostic methods described herein preferably ^re at least 10-40 nucleotides in length. 
25 Ip a preferred embodiment, the oligonucleotide primejs cqmprisq at least 1Q contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule 
having a sequence as..disclqsed herein. Techniques for both PCR based assays and 
, hybridization assays.are yyelli known, ,m the art (see, for ^example, Muljis.et al., Cold 

<..■ , : . .. Spring Hm-hpr r Syinp-, QwmtrBW-* ^^63^9,87^ErUch «k.£<CK Technology, Stockton 
• , :30 Press, NY, 1939). , ., : . 
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One preferred assay ' employs RT-PCR, in which PCR is applied in 
conjunction with reverse transcription. Typically, RNA is extracted from a biological 
sample, such as kiopsy tissue, and is reverse transcribed to produce cDNA molecules. 
PCR amplification using at least 6he specific primer generates a cDNA molecule, 

5 which may be separated and visualized using, for example, gel electrophoresis. 
Amplification may be performed oh' biological samples taken from a test patient and 
from ah individual who is not afi&cted with a cancel The amplification 'reaction may 
be performed on several dilutions of cDNA spanning two orders of magnitude. A two- 
' ? 1 fold or greater 'increase in expr^sib^^'seVaral dilution's of the test patient sample as 

10 compared to' the same dilutions of me noncancerous sample is typically considered 
positive. 

, v- 

■ if i^i - in^anbtriCT ^F* 601 of the present invention, cell capture technologies may 
used in conjunction^ with) for example, real-time PCR to' provide a more sensitive 
tool for detection of metastatic cells expressing colon tumor antigens. Detection of 
15 colon dancer ceils in biological samples, e.g.; bone marrow samples:, peripheral blood, 
" and small needle aspiration samples is desirable for' diagnosis ani! ^rognbsis in colon 
cancer patients. 

Immunomagnetic beads coated with specific monock>hal antibodies to 
surface cell riia&ersf or tetrameric antibody complexes, maty be used to first enrich or 
20 positively select cancer cells in a sample. Various commercially available kits may be 
; used, inciuding D^abeads® 8 Epithelial Enrich' (Dyhal Biotech; Oslo, Norway), 
StemSep™ (StemCell Technologies, Inc.; Vancouver, BC), and RosetteSep (StemCell 
Technologies). A skilled artisan will recognize that other methodologies and kits may 
also be used to enrich 6r positively select desired cell populations. Dynabeads® 
25 Epithelial Ehrich cbntains magnetic beads coated with mAbs specific for two 
glycoprotein membrane antigens expressed on normal and neoplastic epithelial tissues. 
The coated beads may be added to a sample and the sample then applied to a magnet, 
' thereby carituririg me cells bound to the beads. 'The unwanted cells are washed away 
' and me magnetically isblated 

30 ' ' ' RosetteSep can be used to enrich cells directly from a blood sample and 

consists of a cocktail of tetrameric antibodies that targets a variety of unwanted cells 
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and crosslink, them to glycophorin A on, red bjood cells (RBC) present in the sample, 
forming rosettes. When centrifuged oyer FicoH, targeted cells pellet along with the free 
RBC, The, cpmbination of anti]t>q#es in the depletion cocktail determines which cells 
w#l be removed and consequently which^cells will be recovered. Antibodies that are 
available include,, but are not limited to: .CE>2, CD3* CD4, CDS, CD8, CD10, CDllb, 
CD14, CD15, CD16, CD19, CD20, CD24, .CD25, CD29,,CD33, CD34, CD36, CD38, 
qD4J, CD45, CD45RA ? CD45RO, ,££^6, CD66B, CD66e, HLA-DR, IgE, and 
TCRciB. , . 



10 



role 



k - , i t Additi.9»ally, ; it is contemplated in the present invention that mAbs 
specific for colon tumor antigens can be generated and used in a similar manner. For 
example, mAbs that bind to tumor-specific cell surface antigens may be. conjugated to 
magnetic beads, or formulated in. a tetrameric antibody complex, and used to enrich or 
positively select metastatic colon tumor cells from a sample. Once a samplers enriched 
or positively selected ,,, cells may be lysed and RNA isolated. RNA may then be 
l ? i . S 'VW ect ^; to RT-PC^-^y?^ us j"S colon tumpr-specific primers, in a real-time BGR 
n i .^?. say ! T as de ,S c rft ed fef/»U>i Pn^ sWlled in ^he art will recognize that enriched or selected 
populations of cells may be analyzed by other methods (e.g. in situ hybridization^ or 
\- t flow C)rtometry). 

- 1. In ^^Pfj embodiment the compositions described herein may be used 

. as niarkers for the progression of cancer. In this embodiment, assays as described 
above for the diagnosis of a cancer may be ; performed oyer time, and the change in the 
leyel of reactive polypeptide(s) or polynucleotide(s) evaluated, For example, the assays 
may be performed every 24-72 hours for a period of 6 months to 1 year, and thereafter 
performed as needed. In general, a cancer is progressing in those patients in whom the 
level, of polypeptide or polynucleotide detected increases over time. In contrast the 
cancer is not progressing when the level of reactive polypeptide or polynucleotide either 



20 



25 



remains constant or decreases with time. 



/■: ■.: , • r . - -i , Certain m v/vo diagnostic assays may be performed directly on a tumor. 
, 5 ..-Q n ^- s ^ ch r as ?ay involves contacting tum^tj cells Avith a binding agent. The bound 
- f : i b ff d ^ g a Sf^ ^ay JhenL be detected directly, or indirectly via a reporter group. Such 
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binding agents may also be used in histological applications. Alternatively, 
polynucleotide probes may be used within such applications. 

As noted above, to improve sensitivity, multiple tumor protein markers 
may be assayed' Within 1 a given sample. It will be apparent that binding agents specific 
5 for different proteins provided herein may be combined within a single assay. Further , 
multiple primers br ! prbbes may be used concurrently. The selection of tumor protein 
markers may be based ori routine experiments to determine combinations that results in 
i optimal 1 sensitivity. In addition, or alternatively, assays for tumor proteins provided 
' r ■herein may be ; cOmbined withassays for 'other known tumor antigens. 
10' The present invention further provides kits for use within any of the 

above diagnbstic methods. Such kits typically comprise two r or -more components 
' - "nTO&^ary 'for •forming a diagnostic assay.' 1 > 'Components may be compounds, 
reagents, containers ariaVor equipment*- For example, orie container within a kit may 
Contain a monoclonal antibody 'or fragirient thereof that Specifically binds to a tumor 
1 5 protein. Such antibodies dr fragments may be provided attached to' a support material, 
' ' as described above.' One or motfc additional cdntainers^iriay enclose elements, such as 
v.,-. reagents or buffers, 1 to be used in the assay?' Such kits iriay also, or alternatively, contain 
: ^ detebtidh reagbht as described '^bove that contains a reporter group suitable for direct 
' - ox mdirect detection of antibody binding. ' 1 

20 Alternatively, a. kit may be designed to detect the level of mRNA 

5 4 encoding a tumor protein in a biological , sample. 1 Such kits generally comprise at least 
one • oligonucleotide probe or primer, as described above, •■ that hybridizes to a 
polynucleotide encoding a tumor protein. ' Such an oligonucleotide may be used, for 
example, within a PCR or hybridisation assay. Additional components that may be 
25 present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to' facilitate the detection of a polynucleotide encoding a tumor protein. 

• / The following Exahipfes are offered by way of illustration' and not by 



way of limitation. •'' - '' ' io '"• ' ' >' ir 
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• ■■ , -<-<■.:, EXAMPLES , , . 

EXAMPLE 1 .. /, 

.._ .1, IDBN.TIFICAT70N OF COLpN TU^OR,P^pTEIN,CDNAS 

5 > 1 . This Example illustrates the. identification of cDNA molecules encoding 
colon tumor proteins using PCR-based pDltfA.s^biractipn jmetljodology. 

A ropdificajiqa.Qf % lf qionteg|». (P^o-A^i .CA) PCR-Select™ cDNA 
subtraction methodplpgy was templpyed to : obtain cDlSTA jjopulations. .enriched in 
cDNAs derived from, ; ttanscripts ? , ; th$ .are ^^ ^if^erentiajlly ., expressed in colon tumor 
1 0 .samples. By ^s methodplpgy, rnRJ^JA populations, were isolated from colon tumor and 
i metastatic, rumor samples ^este^ mRNA) as weU .as^frpm nprmal tissues, such as 
r . .brain, panpreas^.bpne^marrow, liyer, ( hea#, Jung, stomach and small, intestine ("driver" 
'■>■ •> mRNA K Frqm. thg.tester and dri ver f mRNA populations,, cDNA. was synthesized by 
,. -standard methpdplpgy. See, £ : g., AusubeL F.M..jet al.. Short Protocols in Molecular 

&}?}9gy (4* ed., John l^dley and; Sons, Inc., ,1S>S>9). .. . x 
us .'iriy.iie :.<> - : ^^.f^Sf^m qtejj&.wege perfonned, using PCR-based protocol that 
«.-:> was .modified to, generate, fr^gmente ; larger, .than would -be derived by, the Clontech 
, ; , methodology. By this, modified pro^ocpl^ithe. tester and driver. c^NAs ^ere : separately 
digested with five restriction endonucleases. (Mlu I, Msc I, PvuJL Sal I and Stu I) each 
20 of which recognjze a unique oVbase pair nucleotide sequence. This digestion resulted in 
an average cpNA size of 600, bg, rather , than the average size pf 300 bp that results 
from digestion with Rsa I according to the Clontech methodology. This modification 
did not affect the ultimate subtraction efficiency. ; 

:•-> : Following the restriction digestion, ; adapter oligonucleotides having 
25 unique, nucleotide sequences were ligated onto the 5' ends pf.the. tester cDNAs; adapter 
oligonucleotides were not ligated onto the .driver cDNAs,.. The- tester and, driver cDNAs 
; .were subsequently hybridized one, to the. .ojher. using an excess of driver cDNA. This 
hybridization step resulted in populations of (a) unhybridized tester, cTJN As, (b) tester 
cDNAs hybridized to other tester cDNAs, (c) tester cDNAs hybridized to driver 
30 cDNAs, (d) unhybridized driver cDNAs and (e) driver cDNAs hybridized to driver 
cDNAs. 
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Tester cDNAs hybridized to other tester cDNAs were selectively 
amplified by a polymerase chain reaction (PCR) employing primers complementary to 
the ligated adapters. Because only tester cDNAs were ligated to adapter sequences, 
neither unhybridized tester or driver cDNAs, tester cDNAs hybridized to driver cDNAs 
5 nor driver cDNAs hybridized to driver cDNAs were amplified, using adapter specific 
oligonucleotides. The PCR amplified tester cDNAs were cloned into the pCR2.1 
plasmid vector Onvitrogen; Carlsbad, CA) to create a libraries enriched in differentially 
expressed colon tumor antigen and colon metastatic tumor antigen specific cDNAs. 

Three thousand clones from the pCR2 r l tumor antigen cDNA libraries 
10 were randomly selected and used to obtain clones for microarray analysis (performed 
bV'Rosetta: Seattle, WA) and nucleotide sequencing. The cDNA insert from each 
pCR2.1 clone was PCR amplified as follows. Briefly, 0.5 ul of glycerol stock solution 
was 'added to 99.5 ul of PCR mix containing 80 'pi H20, 10 \x\\ OX PCR Buffer, 6 ul 
MgW 1 10 mM dNTPs, 1 |xl 100 mM Ml 3 forward primer 

15 (CACGACGTTGTAAAACGACGG),^ 1 ul 100 mM M13 reverse primer 
(CACAGGAAACAGCTATGACC), and 0.5 ]il 5 u/ml Taq DNA polymerase. The 
Mi 3 forward and reverse primers used herein were obtained from Operon Technologies 
f Alameda, CA). The PCR amplification was performed for thirty cycles under the 
following conditions: 95°C for 5 minutes, 92°C for 30 seconds 57°C for 40 seconds, 
20 75°C for 2 minutes and 75°C for 5 minutes. , , 

mRNA expression levels for representative clones were determined 
using microarray technology in colon tumor tissues (n=25), normal colon tissues (n=6), 
kidney, lung, liver, brain, heart, esophagus, small intestine, stomach, pancreas, adrenal 
gland, salivary gland, resting PBMC, activated PBMC, bone marrow, dendritic cells, 
25 spinal cord, blood vessels, skeletal muscle, skin, breast and fetal tissues. An exemplary 
methodology for performing the microarray analysis is described in Schena et al, 
Science 270:467-470. The number of tissue samples tested in .each case was one (n=l), 
except where specifically noted above; additionally, all the above-mentioned tissues 

were derived from humans. 
30 The PCR amplification products were dotted onto slides in an array 

format, with each product occupying a unique location in the array. mRNA was 
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extracted from the tissue sample to be tested, and fluorescent-labeled cDNA probes 
were generated by reverse transcription, according to standard methodology, in the 
presence of fluorescent nucleotides vj/5 and i|/3. .See, e!g., Ausubel, et al., supra for 
exemplary reaction conditions for performing the reveree transcription reaction- vi/5 and 
\j/3 fluorescent labeled nucleotides may be obtained, e.g., from Amersham Pharmacia 
(Uppsala, Sweden) or NEN® Life Science Products, ''fac^''(PostoB,''MA). The 
microarrays were probed with the fluorescent-labeled cDNAs, the slides were scanned 
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and fluorescence intensity was measured Genetic MicroSystems instrumentation for 
preparing the cDNA microarrays and for measuring fluorescence intensity is available 
from Aflymetnx (Santa Clara, CA). 

- . A* 1 elevated fluorescence intensity in a micro^ay sector probed with 

cDNA probes obtained from a colon tumor or colon metastatic tumor tissue as 
compared to the fluorescence intensity in the same sector probed with cDNA probes 
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obtained from a normal tissue indicates a tumor antigen gene mat is differentialVv 

■■• . - 1 : - ,•■ v.-- 1 '• . "• f- . r. K , . J 

expressed in colon tumor or colon metastatic tumor tissue ' 

C1 ° n f s disclosed herein as SEQ ID NOs: 1-234 and described in fables 
2-4 were identified from the PCR subtracted differential colon tumor and colon 



, metastatic tumor cDNA libraries by ^h'mcti>ar^"basbd m^twdSlogy. Of these 234 
clones, those corresponding to SEQID NOs: 1, 6, 18-20, 27, 3d,' 37, 40^ 57,65, 81, 82, 
20 86,88,91,95,96, 106, 107, 117, 121, 123^26, 130, 148, 150: 152, 155, 157, 159, 161, 
: 174, 175, 180, 182, 187, 190, 191, i& 203, 204 and 209 showed no significant 



similarity to known sequences in Genbank. 
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EXAMPLE 2 

C907P IS OVEREXPRESSED IN COLON TUMORS 

Using the C907P cDNA sequence, which was discovered from the 
subtracted cDNA library and cDNA microarray discussed above, the Genbatik database 
5 was searched. C907P matches with a known gene named Epiregulin (Genbank 
accession number D30783). Two gene-specific primers were synthesized, and^used for 
PCR amplification to clone this gene from colon cDNAs. The amplified PGR; product 
was sequenced to confirm its identity. Thus, C907P-Epiregulin is a gene up-regtilated 
in colon cancer. PCR was performed under conditions of denaturing cDNA at 94°C for 

10 1 minute, then 35 cycles of 94°C for 30 seconds, 60°C for 30 seconds, -72 6 C for 2 
minutes. Proof-reading polymerase was used for the amplification. The cDNA 
templates used for the PCR were synthesized from colon tumor mRNA. The i amplified 
products were cloned into the TA cloning vector and the sequences were defexftiihed. 
The C907P DNA sequence is shown in SEQ ID NO:234, and the amino acid sequence 

15 is shown in SEQ ID NO :23 5. \. 



EXAMPLE 3 » ■ ;i 

Full Length PCR Amplification and cDNA cloning of the C915P Colon Tumor 

Antigen 

20 The C915P cDNA sequence (SEQ ID NO:33; also referred to by clone 

identifier number 54160), discovered from the subtracted cDNA library and cDNA 

microarray discussed in Example 1, was used to search the Genbank rdatabasfe. C915P 

? - ■ 

, .. .m. , . 

was found to have some degree of similarity to a known '": gene named 
superoxidegenerating oxidase Moxl (Genbank accession number AF 127763). two 

i ' ' ■ * 

25 gene-specific primers were designed according to the sequence deposited in Genbank in 
order to amplify the full-length cDNA. PCR was performed under conditions of 
denaturing cDNA at 94°C for 1 minute, then 35 cycles of 94°C fof 30 second; 60°C for 

t *- * " i 

t *■ 

30 second, 72°C for 2 minutes. Proofreading polymerase was used for the 
amplification. The cDNA templates used for the PCR were synthesized from colon 
30 tumor mRNA. The amplified products were cloned into the TA cloning vector 
(Invitrogen, Carlsbad, CA) and random clones sequenced by automatic DNA 
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sequencing to confirm identity. The full-length cDNA and amino acid sequence of 
C915P is set forth in SEQ ID NO:244 and 245, respectively. 

Expression levels of C915P cDNA were further analyzed by real-time 
PCR. Using this analysis, C9 15P was confirmed to be overexpressed in colon tumors 
5 as compared to a panel of normal tissues. Moderate levels of expression were observed 
in normal colon tissues. Real-time PCR {see Gibson et al., Genome Research 6:995- 
1001, 1996; Heid et al., Genome Research tf:986-994, 1996) is a technique that 
evaluates the level of PCR product accumulation during amplification. This technique 
permits quantitative evaluation of mRNA levels in multiple samples. Briefly, mRNA 
10 was extracted from colon tumor and normal tissue and cDNA was prepared using 
standard techniques. Real-time PCR was performed using a Perkin Elmer/Applied 
Biosystems (Foster City, CA) 7700 Prism instrument. Matching primers and a 
fluorescent probe were designed for C915P using the primer express program provided 
by Perkin Elmer/Applied Biosystems (Foster City, CA). Optimal concentrations of 
15 primers and probe were initially determined and control (e.g., p-actin) primers and 
probe were obtained commercially. To quantitate the amoxmt of specific RNA in a 
sample, a standard curve was generated using a plasmid containing the C915P cDNA. 
Standard curves were generated using the Ct values determined in the real-time PCR, 
which are related to the initial cDNA concentration used in the assay. Standard 
20 dilutions ranging from lb-lb** copies of the C915P were generally sufficient. In 
addition, a standard curve was generated for the control sequence. This permitted 

standardization of initial RNA content of the tissue samples to the amount of control for 

■ o . : - ■■' < -.-1 , (•:. ' ■ : i': ' < ; ■ 

comparison purposes. 

25 EXAMPLE 4 

Production of Ra 1 2-C9 1 5P-F3 Recombinant Protein in E. Coli 

C915P (also referred to as clone identifier 54160, and set forth in SEQ 
ID NOs:33 and 244 (cDNA), and 245 (amino acid)) has 6 transmembrane domains 
(TMs) with 3 extracellular loops (EDI, ED2, and ED3). The deletion recombinant 
30 protein, Ral2-C915P-G (set forth in SEQ ID NOs:236 (cDNA) and 237 (amino acid)), 
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is an N-terminal Ral2 fusion of recombinant, modified C915P in pCRXl vector (EcoR 
LXhoI). 



«, t i 



Cloning Strategy for Ral2-C9l5P-f3: 

> o ■ f ■ :. ., . — ; — — ,. 

Three sets of primers were designed that were used sequentially to delete 
two internal transmembrane domains and amplify a recombined internal region of 
C915P that was put with EcoRI and Xhol and ligated in frame with Ral2 in the pCRXl 

vector. 



* i. L.J 



10 



• ,* .. PC»#1 used primers AW1 57 and AW156(SEQ 'ro NO&'l and 240 
respectively) to amplify the entire construct, deleting TM4 - ID3 - TM5. The PCR 
product jc915P(imnuVl^ID3^M5) PCR Blunt II TOPO backbone) was purified 
from agarose gel, ligated by T4 DNA Ligase and transformed into NovaBlue E coli 

■. * ' i.' . j . -: ; , i ' ^ 'j • ; ^ \ \ i ? < - t ■ ■ ■ i' ' . .. <■ ) 

, cells with the following standard protocol: the competent £ coli cells ^ were 'thawed on 

,, ice, DNA (or ligation mixture) 'was added, the reaction mixed and' incubated" on ice for 5 

■ I • n ? nutes - The £ co/i cells were heat-shocked at 42°C for 30 seconds, and left on ice for 

' » c, - 2 niinutes - Enriched growth media was added to the 1 coli md they were grown'at 
1 J ^ , 3 !-? for 1 hour - The culture was "plated on LB (plus appropriate antibiotics) and grown 
.,| overnight at 37 C. The next day, several colonies were randomly selectetl for miniprep 
(Promega, Madison, WI) and were confirmed by DNA sequencing for correctly deleted 
region. This step was then repeated on a second region of C915P as described below: 
20 „ PCR#2 used primers AW155 and AWl54 (SEQ ID r^ds:239 and 238 

respectively) to delete TM2, using a confirmed clone from PCR^l as template. The 
PCR product (C9 1 5P(minusTM2 / TM4-ID3-TM5) PCR Blunt II TOPO backbone) was 
purified, ligated and transformed using standard protocols into NovaBlue celis, yielding 
clones that were confirmed by sequencing for the correct deletion. 
25 PCR#3 used primers AW158 and AW159 (SEQ ID NOs:242 and 243, 

respectively) to amplify the deleted, recombined three-part fusion protein of C915P 
EDI - ID2-TM3-ED2 - ED3, using the confirmed' PCR#2 clone as template PCR 
product from PCR#3 was purified and digested using EcoR I and Xno I for ligation into 
the pCRXl vector (EcoR I,'"Xhd I). The ligation mixture was transformed into 
30 NovaBlue cells by standard protocols, and several' clones were selected for miniprep 
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and sequencing. UI#70526 was , confirmed by DNA sequencing, to be. the correct 
pCRXl Ral2-C91 5P-f3 construct. . . . , , 



J . ■ '-1 



Cloning Primers: 

5 C9l5P-AWi54 (SEQ IDNO:238): antisense cloning primer to delete 

TM2, 5 'P— Primer Id9682: 5' P- 

^ ' .r ' P, ' " , * A ' •. P 

C915P-AWi55 (SEQ ID NO:239): sense cloning primer to delete TM2, 
5'P— Id 9683: 5' P-TGTCGCAATCTGCTGTCCTTCC. 

' C915P-AWi 56 (SEQ ID' >JO:240): antisense cloning primer to delete 

10 1M4-TM5 region, 5>-P, -^mexid^^S-P- GCTGGTGAATGTCACATACTCC. 

C915P-AWT57 (SEQ ID NO:24i): sense olonmg primer to^elete' TM4- 
TM5 region; : 5'-P - Id 9685: 5' ; P- CGGGGTCAAACAGAGGAGAG. • ' 

the fusion protein with EcoR I site Primer Id • 9686: 5' 
15 gtcgaattcGATGCCTTCCTGAAATATGAGAAG. 

Ral2-C915P-P3-AW159 (SEQ ID NO:243): antisense cloning primer 
:■ for the fusion protein with stop and Xho K site Primer Id 9687: 5' 

io cacctcgaj 




1 1 ■ Protein Mforrnation for Ral2-C915P-f3: 

20 S - i 1 MbiecuW Weight 32429.45 Daltons 

-.. •. ' ■ .■ . , . ■■. ■'• ;i 

'297 Amino Acids 

' : : 28 Strongly Basic(+) Amino Acids (K,R) 

; ' 27 Strongly Acidic(-) Amino Acids (D,E) 

i 

! ' 93 Hydrophobic Amino Acids (A ,I,L,F,W/V) 

2 5 ; 86 Polar Amino Acids (N,C,Q,S;T,Y) 

' f 7.776 Isolectric Point 

! - ' 3.711 Charge at PH 7:0 



;S Protein Expression; 

- Mini expression screens were performed to determine the optimal 

30 induction conditions for Ral2-C915P-f3 . The best E. tofrk^cMxe conditions 
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were screened by transforming the expression construct into different hosts, then 
varying temperature, culture media and/or IPTG concentration after the inducer IPTG 
was added to the mid-log phase culture. The recombinant protein expression was then 
analyzed by SDS-PAGE and/or Western blot. & coli expression hosts BLR (DE3) and 
5 HMS (DE3) (Novagen, Madison, WI) were tested in various culture conditions, with 
little foil-length Ral2-C915P-f3 expression detected and. Western blots showing some 
bands at unexpected nqtolecular weights. Tunei; (DE3) . cells (Novagen, Madison, WI) 
were then tested with helper plasmids at various IPTG concentrations. Coomassie 
staiped SpS-PAGE showed no induced band but Western blot confirmed -a strong 
10 „ i ^ 2 "C9J5P-B .signa] a^t: 32kjp probing with an. an^i-o^chis lag antibody. The most 
optimal expression; for pC&Xl Ral2-9?15P-f3 was found to be, m the host strain Tuner 
r.i ( D E3),;with a helper plasmid, grown in Soy Terrific Broth .niedia at 37°C induced with 
1 .0. biM IPTG at 37*0 for 3hr. . ■? . , 

• - .'. < J. „ i > _ ■ . \ ; ■ t ■ i 

J<5. ..f:.., . •. • ,. ,;; • ./. - ; ,-„„>; EXAMPLE 5 -t v" f ,( 

V So v Purification qfhRa 1 2-C9J 5P-f3 Recombinant Fusion Protein from E. Coli 

* 

The clone C9I5P^&. , ftiJfod.t&^ of colon 

cancer tissues. For expression in E. coli, the construct Ral2-C915P-f3 (SEQ ID 
NO:236) was made as described in Example This construct encodes a fusion protein 

20 consisting of an Nrtepnjnal 6x histidine tag followed by Ral2 and modified C915P 
(excluding 5 of 6 transmembrane domains) (SEQ ID NO:237). The 32.4kD protein was 
expressed in multiple large baffled shaker flasks containing 1L of Soy Terrific Broth 
media, j The cultures were spun and cejl. pellets washed, respun and frozen for 
purification. After cell lysis, the recombinant protein was found in the insoluble 

25 inclusion body fraction. The inclusion body was thoroughly washed with buffered 
detergents multiple times, then the protein pellet was denatured, reduced and 
solubilized in buffered 8M Urea and Ral2-C915P-f3 protein was bound to a Ni-NTA 
affinity chromatography matrix. The matrix was washed to rinse away contaminating 
E. coli proteins and Ral2-C915P-f3 was subsequently eluted using high Imidazole 

30 concentration. . The. , fractions containing Ral2-C915P-f3 were pooled and slowly 
dialyzed to allow for renaturation of the protein. The purified Ral2-C915P-f3 was then 
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filtered and quantified. SDS-PAGE analysis showed the elution pattern off the nickel 
column with the major band running at the expected weight of about 32kD. This was 
further confirmed by western blot' using an anti-6x His tag antibody. The western blot 
also revealed evidence of dimers and tetramers of the recombinant. N-terminal 
5 ' sequencing confirmed purity of about 90%. Purified yield was about 2.5 mg/L 

■ T 

induction. 

x 1 FolloWing is d detailed protocol of the production of purified Ral2- 

C915P-f3. ( 
' : For me frozen bacterial cell pellet : ^ 
10 1. Thaw bacterid ceU peUet from 1L induction on ice 

v ' ; r '2. i: Add^Sfal'sonica^dn' tosffer^OmM Tris, 500mM NaCl) per liter 
' - of induction culture ; ; " 
■ ; " : 3f 11 Add 1 Complete protease inhibitor tablet and 2mM PMSF 

(Phenylmethylsulforiyl fluoride) to sonication buffer/pellet mix 
15 4. Completely reSuspend pellet with pipet ! 

' ' ' 5. ! A!id-'d!ihi^^ f l^s6^^%ti^e l t^ / &om lyophilized lysozyme 

stored at —20 C) 

! " ;: ' r " : lh " '■ Decant into'a giass beaker + strPbar, gently stir at 4°C, 30 min 

7. ' French Press 2 x 1 lOOpsi, keep on ice 
20 1 8; :> ' Once lysis 'solution** has low viscosity, spin at 11000RPM, 

30min, 4°C 
9. Save supernatant** and pellet 

For the pellet from step 9 above: 
25 1. Wash pellet with 25ml 0.5% . CHAPS (3-([3- 

Cholamid6prbpyl]dimemylammonio)-l-propanesulfonate) wash 

' ' (20mM Tris (8.0), 500iru\l NaCl) ** by sonicating 2xl5sec 

' ' ". r ' ; '' : ] @15Watt ''• ■ '' -' J ' "' ; '" : ' 

; '' '• ; 2. ; Spiri atli000RPM for 25min^ 
30 3. Repeat above steps 3 tittles with' 0.5% DOC (Deoxycholic Acid) 

' ' " wash (20mM Tris (8.0), 500mM NaCl) - : 
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- {4. ,. Resuspend pellet in t pellet . binding buffer (20mM Tris (8.0), 
.< . . ••.» .... 500mM . NaCl, 8M Urea, 20mM Imidazole, lOmM 0- 

. Mercaptoethanol) with sonication ... . 
■ . 5. Equilibrate Ni ++ NTA (l^itrilotriacetic acid), resin (Qiagen, 

5 1 , - . Valencia, CA) with pellet binding, buffer, spin down and decant 

wash (use 4ml resin) 

6. ; , Add resin to resiispended pellet,^ stir at room temperature for 

45min 

7. Prepare colprpA and byff^rs, .rin^e, column with pellet binding 

8. ■ fj Pour pellet/Ni ^8^.^0,0.0^111^ collect flow through (FT)** 

9 . Wash column with ^ 0ml pellet binding buffer * * 
1Q, ; . ^Vash column ^dt,h 30ml pellet binding buffer with 0.5% DOC 

. .(Depxycholiq Acid)** 
1 5 11, Wash column v$th 30ml pellet binding buffer 

■o^vi ' I^ 1 , ^ fraction of pellet binding buffer #1 (binding 

buffer +300mM Imi4,azQ}&)**. . 
,13. . Elute with 2. x 5ml fractions of pellet elution buffer #2 (binding 
buffer +300inM Jniidazqle ! , pH 4.5)** . 
20 14. Run SDS-PAGE to screen purification steps (westerrt and 

coomassie stain) 



**Save an aliquot at 4°C for each purification step to check on SDS- 
PAGE. 



1 j . * 



EXAMPLE 6 , 

r f ■ ...REAL-TIME PGR ANALYSIS OF COLON .TUMOR CANDIDATE GENES 

The first-strand cDNA to be used in the quantitative real-time PCR was 

; it' *, • 

synthesized from 2Qng, of total RNA that had been treated with DNase I (Amplification 
30 Grade, .Gibco Technology, Gaitherburg, MD), using Superscript Reverse 

Transcriptase (Rp (Gibco BRL .Life Technology, Gaitherburg, MD). Real-time PCR 
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was performed with a GetieAmp™ 5700 sequence detection system (PE Biosystems, 
' Foster City, CA). The 5700 system uses SYBR™ green, a fluorescent ^dye that only 
intercalates into double stranded DNA, and a set of gene-specific' forward and reverse 
primers. The increase in fluorescence is monitored during the whole amplification 
5 process. The optimal concentration "of primers was determined using a checkerboard 
approach and a pool of cDNAs from breast tumors was used in this process. The PCR 
: reaction was performed in 25fd volumes that include 2.5ul of SYBR green buffer, 2ul 
of cDNA template and 2.5fU each of ; me forward 1 and reverse primers for the gene of 
interest: The cDNAs used for RT 'reactioris were diluted 1 HO for each gene of interest 
10 arid 1:100 -for the P-actin control. • In order tb quafltif ate the amount of specific cDNA 
1 ^and' hSi/ce initial mRNA) in the' sample, a standard-curve is g^erierated for each run 
tfsing the 5 plashlid DNA containing the gene of iritbrestJ : Standard curves were generated 
f - ■ uMng me Ct values r determined in me real-time PCR which were related to the initial 
cDNA concentration used in the assay. Standard dilution ranging from 20^2x10 copies 
15 of the gene of ; interest was used for thJs purpose. * In addition, a standard curve was 
'•' generated for p-actiri ranging from 200fg-206ofg; ; This enabled 'standardization of the 
• -^liriitial -R^A "content of a ft&ue -^pl^^'-'^'t^D^f %¥' : p^iEKrthi for comparison 
" n purposes: The mean'copy number for each group of tissues tested was normalized to a 
constant lamount of p-actin, allowing thb evaluation' of the over-expression levels seen 

20 with each of the genes. ^ - t{ ' ' " ;? 

' Colon tumor candidate genes, C906P (SEQ ID NO:5), C907P (SEQ ID 
NO:234 (cDNA) arid 235 ! (amino acid)), C911P (SEQ ID NO:21),' C915P (SEQ ID 
NO:244 (cDNA) and 245 (amino acid)), C943P (SEQ ID NO: 140), and C961P (SEQ 
ID NO:200), were analyzed by real-time PCR, as described above, using the short and 

25 extended colon panel. These genes were found to have increased mRNA expression in 
30-50% of colon tumors. For* C9&6P; slightly elevated expression was also observed in 
nbrfrial trachea,- heart, arid normal colonV For C907P, elevated expression was also 
" observed in activated PBlvlC arid slightly elevated expression in heart and normal 
colon. For' C91 IP, slightly elevated expression was observed in pancreas. For C915P, 

30 no expression was ' observed in norrrial tissues 'except normal colon. For C943P, no 
expression v/ais observed in normal tissues' except normal cblonl For C961P, some 
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expression was observed in trachea and normal colon. Collectively, the data indicate 
that these colon tumor candidate genes could be potential targets for immunotherapy 
and canper.diagnpsis. , \, ; A * < » / ,.. 

, ;'" ' ■ 1 > i . ' ' ■ " . ''•* -■ • ■ * t •' '»*••;-' - - t 

S\ - ;-J ... ; , EXAMPLE 7 t ;-, . , . . 

i -, " -i n f : . Peptide Priminp^Qf TtPeijpep. Lin^s : L } ... 

; f i; l; Generation of ,CD4^ T hejiper, lines an 

: ; . derived frpm tumorrspeqifiq aptigen^th^t ^re cqp^ble qf being recognized by CD4 + T 

t : . cells in the context of JJLA clas? II ^plgculpsj is.pamed^out;^ follows: r ., ■ 

^ ; Y T 10 . ; l- f : : , -Fift^r^fr.peptide^ overlapping t by ,10 ^nupo T apic}sj ^eqvpd |pm a 

; , r y ; j ti^pyrspeci^ I^^^ntic ; cells (DC) 

:i are derived from > PBMG, of ; 3 normal donor using GJy^CSF and IL^4 by standard 

* . ■» ' .... 

■ , protqcols.^ £^ 

: , ; , n bead3 (Miltepji Biotec^ufcumj CA) andjQfig^y^ selector*, PC are pulsed overnight 
.^. v . -15/: with pools of the 15-mer peptides, ^ith each peptide at a, final concentration of 0.25 
^th to r,<;- li^yti- y are wash^d ( an^ { plated at ; V x 10 4 cells/v^ell of, 96-well V-bottom 

; j , plaftss ^nd pi^fied^G^^ \ Orwell t > . ^ultur^s \ ai$ supplemented 

, with 60, ng/ml IL-6 and 10 ng/rpil JL-12 an^ ^ubated, at 37°^, Cultures are 

.xestimulated as above on a weekly basis 7 using : DC ( generated, and jpvilsed as above as 
20 antigen presenting cells, supplemented with 5 ng/ml IL-7 andlQ U/ml ILt2. Following 
4 in yitro stimulation cycles, resulting CD4 + T cell lipe^ (e^cji line corresponding to one 
M well) are tested for specific proliferation and cytokine production in response to the 
i stimulating poo|s of peptide with an prelevaqt pool of p^ptide$ ysed as a control. 

i« - ^ ■ i- i EXAMPLE, 8 , iV , , ^ i ,*-. 

; - : . GeNERAJipN OF ^MGR-SfECIflC LlN^S lISllsfG jKyiTRO^WHpLE-GENE.PRIMING 

; j; ; , . , j , Using in vitro. Ayhole-g^ne priming wi^h ^tumpp antigen-^vaccinia infected 

V) :pC. l (see ? ..fpr.,ex^pk r ,Yee et a],, me Journal, of Immunology, 157(9):4079-86, 1996), 
30 , hurn l an /i PT]L lipe? ^arp derived that specifically jcecpgnize autologous fibroblasts 
. . transduced with .a . specific tumoi; antigen, a$ detemiined -by interferon-y ELISPOT 
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analysis. Specifically, dendritic cells (DC) are differentiated from monocyte cultures 
derived from PBMC of normal human donors by growing for five days in RPMI 
medium containing 10% lium&i serum, 50 ng/ml human GM-CSF and 30 ng/ml human 
IL-4. Following culture, DC ar^ infected overnight with tumor antigdi-recombinant 
5 vaccinia virus at a multiplicity of infection (M.O.I) <*f : fi Ve > 3X1(1 matured overnight by 
the addition oJT3 }xg/ml CD40 ligand; Virus is then inactivated by UV irradiation. 
CD8+ T cells are isolated using a magnetic bead system, and priming cultures are 
initiated using standard culture techniques. Cultures are restimulated every 7-10 days 
using autologous primary fibroblasts retrovirally transduced with previously identified 
10 tumor antigens. Following four stimulation cycles, CD8+ T cell lines are identified that 
specifically produce interferon-y when stimulated with tumor antigen-transduced 
autologous fibroblasts. Using a panel of HLA-mismatched B-LCL lines transduced 
with a vector expressing a tumor antigen, and measuring interferon-y production by the 
CTL lines in an ELISPOT assay, the HLA restriction of the CTL lines is determined. 

15 

EXAMPLE 9 ' 
Generation and Characterization of anti-Tumor Antigen monoclonal 

antibodies 

20 Mouse monoclonal antibodies are raised against E. coli derived tumor 

antigen proteins as follows: Mice are immunized with Complete Freund's Adjuvant 
(CFA) containing 50 jig recombinant tumor protein, followed by a subsequent 
intraperitoneal boost with Incomplete Freund's Adjuvant (IF A) containing lO^ig 
recombinant protein. Three days prior to removal of the spleens, the mice are 

25 immunized intravenously with approximately 50 jag of soluble recombinant protein. 
The spleen of a mouse with a positive titer to the tumor antigen is removed, and a 
single-cell suspension made and used for fusion to SP2/0 myeloma cells to generate B 
cell hybridomas. The supernatants from the hybrid clones are tested by ELISA for 
specificity to recombinant tumor protein, and epitope mapped using peptides that 

30 spanned the entire tumor protein sequence. The mAbs are also tested by flow 
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cytometry } for their ability to detect tumor protein oi} the surface of cells stably 

t ■ ■ ■ .. 

. transfept?d ; with ^he cDN^encqding the tumor protein. , t . f 

From the , foregping it will . be appreciated that,, although specific 
embodiments of the inventipn have been described, herein for purposes of illustration, 
5 V various modifications may be made without deyiating from .the spirit and scope of the 
. , -. invention. Accordingly, the invention i^ not limited ?xpept as by the appended claims. 
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CLAIMS 

What is Claimed: ' 

1. An isolated polynucleotide comprising a sequence selected from 

the group consisting of: 

(a) sequences provided in SEQ ID NO: 1-234, 236, and 244; 

(b) complements of the sequences provided in SEQ ID NO: 1-234, 

236, and 244; r ' 

(c) f sequences 66iisistihg > of at' least 20 contiguous residues of a 

; sequence provided in SEQ ID NO: 1-23*4, 236^ Shd 244; 

(d) sequences that hybridize to a sequence prbvided in SEQ ID 
J NO: 1-234, 236, and 244, under moderately stringent conditions; 

(e) sequences having at least 75% identity to a sequence of SEQ ID 

NO:l-234, 236, and 244; ; 5 r 

i (f) • ' sequences having at least 90% ; identity to a sequence of SEQ ID 

NO: 1-234, 236, and 244; and 

1 ' (g) degenerate! Variants of a sequence provided in SEQ ID NO: 1-234, 

236, and 244. 

2. An isolated polypeptide comprising an amino acid sequence 

selected from the group consisting of: 1 1 

(a) sequences encoded by a polynucleotide of claim 1 ; 

(b) amino acid sequences set forth in SEQ ID NO:235, 237, and 245; 

(c) sequences having at least 70% identity to a sequence encoded by 

a polynucleotide of claim l;and 

(d) sequences having at least 90% identity to a sequence encoded by 

a polynucleotide of clkim 1 . r r - ! ! 

3. An expression vector comprising a polynucleotide of claim 1 
opef ably liMed to an expression contr61 sequence. ' 1 " ' * 
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4. A host cell transformed or transfected with an expression vector 
according to claim 3. 



. 8 5. An isolated antibody,; pr antigen-binding fragment thereof, that 
specifically binds to a polypeptide of claim 2. 



6. A method for detecting, the r presence of a cancer in a patient, 
comprising the steps of: 

[ ; ; (a)» v obtaining a, biplpgical sample ,^pm the. patient; 

(b) contacfipg the biological ^ampl? with, a bidding agent that binds 
to a pplypepti<jp of claiip 2; 

: (°) - detecting jjji. the : S£^]je,.Wvpn^imt^f polypeptide, t^at binds to 
t^e bii^ding^agent; and. >^ )t 

(d) comparing the amount of polypeptide to a predetermined cut-off 
O .[ O i : ^value and therefrom determining the presence pf a cancer in the patient. 



1 ■ T M _ _ 

V -I . . ' l 



! / v ' 7tm ; q .^fosi$^^ o^e polypeptide according to 

claim 2. 



> 8. l An oligonucleotide that hybridizes to a sequence recited in SEQ 
ID NO: 1 -234, 236, and 244 under moderatejy stringent conditions. , 



9. Argi^hod for stirtii^lating and/or expanding T cells specific for a 
,tumor protein, comprising contacting T celljs with.at least ope component selected from 
the group consisting of: . r 

(a) polypeptides according to claim r 2; , 

(b) polynucleotides according to claim 1; and ..... t r 

(c) antigen-presenting cells that express a polypeptide according to 

under conditions and for a time sufficient to permit the stimulation 
and/or expansion of T cells. 
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: 10. An isolated T cell population, comprising T cells prepared 

according to the method of claim 9. 



11 A composition comprising a first component selected from the 
group consisting of physiologically acceptable carriers and immunostimulants, and a 
second component selected from the group consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim I; 

(c) antibodies according to claim 5 ; 

(d) fusion proteins according to claim 7; 

(e) T cell populations according to claim 10; and 

: (f) antigen presenting celis that express a polypeptide according to 



claim 2. 



1 VI A method for stimulating an immune response in a patient, 

comprising administering to the patient a composition of claim 1 1 . 

r t 

13. A method for the treatment of a cancer in a patient, comprising 
administering to the patient a composition of claim 1 1 . 

14. A method for determining the presence of a cancer in a patient, 

comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide 

according to claim 8; 

(c) detecting in the sample an amount of a polynucleotide that 

hybridizes to the oligonucleotide; and 

(d) compare the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence of the cancer in the patient. 
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15. A diagnostic kit comprising at least one oligonucleotide 



according to claim 8. 



16. A diagnostic kit comprising at least one antibody according to 
claim 5 and a detection reagent, wherein the detection reagent comprises a reporter 
group. 



17. A method for inhibiting the development of a cancer in a patient, 



.i ■■ i 



comprising the steps of: 

(a) incubating CD4+ and/or CD8+ T cells isolated from a patient 
with at least one component selected from the group consisting of: (i) polypeptides 



.O.i\ = * v 



according to claim 2; (ii) polynucleotides according to claim 1; and (iii) antigen 
presenting cells that express a polypeptide of claim' 2, such that T cell proliferate; * 

(b) administering to the patient an effective amount of the 
proliferated T cells, and thereby inhibiting the development of a cancer in the patient. 
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SEQUENCE LISTING 



<110> Corixa Corporation 
Jiang, Yuqiu 
Hepler, William T. 
Clapper, Jonathan 
Wang, Aijun 
Secrist, Heather 



<120> COMPOSITIONS AND; METHODS^ FOR. THE THERAPY . 
; AND .DIAGNOSIS OF COLON CANCER ; . .. 



I ■ o J 



<13Q> 210121. 524PC 



<140> PCT 

<141> 2001-06-08 



<160> 245 



<170> FastSEQ for Windows Version 4,0 



<210> 
<:211> 
<212> 
<213> 



.1 

334 

DNA. 

Homo 



).:<'■ 



sapien 



=>..) i-. ... 



<400> 1 '■-< ' ■;.r'-.".. : s '...:„... ::m r ■: -< " ~> i-<- . •* '- ■- 

actcaatatt ataaaaacct caaataattg acttgatttfe acacaaca^c qt.tpcctttt 

ctacaagtta atttttttac aaatcatttg ggttatctc^r.it^a^feaggfet, atatt;ttatt 

gcttctagaa acaatgtttc aaaatatatg tgcattatca gtaataattt gtataaatat 

ttcccacaac aattttcata attttcaaag actaatttct tgactgaaga tattttgcta 

gggaagtgaa actttaaaat tttgtagatt ttaaaaaata ttgttgaatg gtgtcatgca 

aaggatttat atagtgtgct cccactaact gtgt 



<210> 2 
<211> 650 
<212> DNA 
<213> Homo nsapien 



-T - > 



<220> 
<221> mis cofeature 
<222> <1>. ..(650) 
<223> n ~ A, T, 0 or G 



.." J 



<400> 2 

actttgttat ttttccatca ctaaaggcca atcagaattt ggaaccatgc tgctacccaa 
gaaatctaat ggaatgaatt agttctgtag atgacaattt cttcacccat ttatgagacc 
taaatctttt ccataacact catgtattca gtataacaac atactaactg aaagagggac 
ctgattgttt aaagtttgat tgcagacgct gtagaacata actcattatg tttcagataa 
ggtaactcct agatatcaaa ctaatttgtt ggggtagaga ttttacaagt catgccatta 
gaagattttc tctgatatta tatgtgcagt tcagttacaa gatgaaatca tgttttttta 
acaaaagaga taaaatacaa .ttgaagcaaa aaataaca'gc tagtatataa tatatacagt 
ctgtatttgc ttttcacagt aggcctgatgr actaaaagat • atgc.t-t.tatt acacgctatt 
ttcacctctt -gaaagtcaaa ggtgatgatt .aatttcattt agcagggaag tggaataata 



,.6-0 
120 
18 0 
24 0 
30.0 
334 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
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» 2 



tcttttgaaa taactaagtc cactaaatta tcagtatgct attctggggt ctaagtacct 
gnccggcggn cgctcaaang gcgaattctg cagatatnca tcaccttggc 



<210> 
<211> 
<212> 
<213> 



3 

444 
DNA 
Homo 



sapxen 



600 
650 



<400> 3 

acacatccca 

aagtttgtgc 

gtaatatttc 

ctaaggctaa 

acagtttgat 

aacttggcaa 

tactccaatt 

ctgcccgggc 



tcttcaaatt 
tataaaattg 
gcttcctaaa 
ccaaacttag 
ttaaaaacaa 
acccttccct 
aaatgcacta 
gggcaagggc 



taaaatcata 
tgcaaatatg 
tttcttccac 
atataaa'tcc 
aacagaaaca 
aactgtccaa 
ctctttcact 
gaat 



<210> 
<211> 
<212> 
<213> 



4 

509 
DNA 
Homo 



ttgtcagttg 
ttaaggattg 
ctacagataa 
taccaataaa 
aatttcaaaa 
gtatgagcat 
ggtctgaatg 



tccaaagcag 

agacccacca 

tagacaacaa 

atttttcagt 

taaatcacat'. 

acactgccac 

aagtatggtg 



cttgaattta 
atgcactact 
gtctgagaaa 
bttaagtttt 
cttc'tcttaa. 
tggctttaga 
aaacaagtac 



60 
120 
180 
240 
300 
360 
420 
444 



sapien 



<400> 4 
aaaaacaaaa 
ctgccctttg 
tttgcttcct 
taatagatta 
acatgcaaaa 
atatatgaaa 
aacattgtca 
^agcttattg 
ataattca^a 



ttaaattttc 
gttaacagca 
taagttttca 
actaaactgc 
tccacataaa 
actgcatcat 
tatgtgaaaa 
taagWggttcf 
taaaatgcTefc; 



atttcaatta 
tcagcatcac 
acatatcatt 
actgtaattt 
ttgtcattta 
caattcaacg 
cgacacccta 
aacfe- 1 tt g*ta5 

> .'7 
• •(- 



agaccccttt 
attactattt 
tatatttaaa 
agataaaatt 
accaacagta 
ttttggtact 
agtccttctt 
■fet?t>t f gtaaict: 



tggcattttg 
tatattgcat 
ggcagacact 
actgtgtctc 
ctgcacgagc 
tgaaactgca 
tttaaaaatg 
>atctttaagc: 



c. ? o . ; j 



<210> 
<211> 
<212> 
<213> 



5 

478 
DNA 
Homo 



r. 



cttatttatt 60 

atatgtagca 120 

gagtcagtat j 180 

actgtgtatt ;240 

gaacatcteg 300 

tcataaatgc 360 

acattgcgtt ■ ,420 

•tctt.cagt.tt.. /48.G 

■- x ..r . .. -.t :. 509 

.'' '. ." l h -v. : i~ !;> L'J' " ; ' \, ^ V 

T; 'J" j .-. JF ;; z I j :.' ' 



. .j ■ ■ 



sapxen 



<400> 5 

acattgagta 

tctttctcca 

gtggagggaa 

tcatcattag 

agaagtcata 

aattatagaa 

cccgggcggc 

catctagagg 



gagcatcaag 
aaactccaaa 
gaagtccttc 
caggtatccc 
cgttaagtag 
atgtatagac 
caagggcgaa 
gcccaattcg 



agcaataaaa 
aagagctctt 
cagcaatgat 
ttctccctcc 
atctacaact 
acctcatact 
ttctgcagat 
ccctatagtg 



aagacttcaa 
cgaagggctc 
aagcatgtaa 
cttgtcagcc 
catttgatat 
caaataagaa 
atccatcaca 
agtcgtatta 



aaaaggttac 
ttatgacatc 
tgagtcgtct 
ttccttcctt 
gaagcgttac 
actgacttaa 
ctggcggccg 
caattcactg 



aagagcattc 

ccacg.gtctca 

ttctagcaca 

ctttgaaagg 

caaaatqtta 

atggtacctg.. 

ctcgagcatg 

gccgtcgt 



<210> 6 
<211> 485' : 
<212> DNA,,. 
<213> Homo "sapieri 



"v, 



<4 00> 6 : =. . 4 ; t - s c. 

aaatgtceaa ggtggcccca agggaggacfe tetgcagcac: agctcccttc ccaggacgtg 

aaaatctgec ttct caeca t gagge'ttcta gticctttcca^ gcctgctctg tatcctgett, 

ctctgcttct ccatcttctc cacagaagggp i-aarg^gg-cgtc^etgccaaggc ctggtcaggc 



60 
120 
180 
240 
300 
360 
420 
478 



: 60 
.12.0 
■180 
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aggagaacca ggctctgctg ccaccgagtc cctagcccca actcaacaaa cctgaaagga 
catcatgtga ggctctgtaa accatgcaag cttgagccag agccccgcct ttgggtggtg. 
cctggggcac tcccacaggt gtagcactcc caaagcaaga ctccagacag cggagaacct 
catgcctggc acctgaggta cctgcccggg cggccaaggg cgaattctgc agatatccat 
cacactggcg ggccgctcga gcatgcatct agagggccca attcgcccta tagtgagtcg 
tatta ........ , . - - ' 



<210> 
<211> 
<212> 
<213> 



7 . 
4 83 , 
DNA 
Homo 



240 
300 
360 
420 
480 
485 



'sapien 



<220> 

<221> misc_f eature 
<222> (1) . . . (483) 
<223> n — A, T, C or G 



<400> 7 

actgctggct gccccggctg gtcagtgggg caaagccggg catgaagaag tgcagccggg 

gaaacgggac catgttcaca gccagcttcc gcaggtcagc attgagctgg cctgggaagc 

gcaggcaggt ggtgacccca ctcatggtag cagacaccag gtggttcagg t caeca tagg 

tgggcgtggt cagctttagg gttctgaagc aaatgtcgta gagagcttcg ttatcaatgc 

agtaggtctc gtctgtgttt tctacgagct ggtggactga gagggtggcg ttgtagggct 

ccaccactgt gtctgacact ttgggcgaag gcaccacact aaacgtgttc atgatcctgt 

ctgggtacct. gpccggggcg tcgaaagggc gaattctgea gatatccatc acactggeng 

gccgctcgag catgeatcta gagggeccaa ttcgccctat agtgagtcgt attacaattc 
act - - • •.. - , ■ -. •• ; - • : ■■ ■■ \j ! -■ - ' 



60 
120 
180 
240 
300 
360 
420 
4 80 
4 83 



<210> 8 , . : • 

<211> 398 r iZ ' 

<212> DNA , ; > )' 

<213> Homo, .sapien 



J ... ■ . . 

' ■■ . ' -1,7 ' "= 



I re 



i i ■ 



.i s - 



<400> 8 

acaaggcaga tggagcattg acgttttcaa aaccattatt cctgtgactg gagaggcatc 
aggagagggc tegttegtet ccagctcata aaatgtagca gcatcatcct tgacagtgat , 
gtttttcagg ccctccattg agaacctgag gaaatctgta aagataagtg gtgatgttgt i 
ttcaaacgtt cagaacagat accatcatcc tgcctttgtt agctgctgta gggaaagtgc 
gttacagatg tctgctgacc tcacaagagt gaaaagataa actgtgcatg tgtttccact 
teegtttcta gtacctgccc gggcggcaag ggegaattet gcagatatcc atcacactgg 
gcgccgctcg. ageatgeate tagagggece aattcgee 



60 
120 
180 
240 
300 
360 
398 



<210> 
<211> 
<212> 
<213> 



9 

493 
DNA \ 
Homo 



sapien 



<400> 9 . ■ : >■ 

acagctttta tatctggagt agctatttag tgctccttct ctacctaagc aaggtttgac 
tgatagtcac tggagttttc ctgeagaact tggtcatatc cactcatact gctctgacca 
ccataaccac ctccataacc accactcagc tgctggctag caggacctcc ataactagac; 
tggttggata agcccatccc tcccatcatt tggctaccat aagcgccacc acttgcccct; 
gctgtagaat tcaaaaaaag ttctacatag ctgtgatcgt aagcaccccc acttgttcct 
gcagtagaat ttaagaagag ctccacatat ctgtgttgca tattagcttt gtcttttgee 
atagctgeca cagcatcttc atgagtagca aattcaacat ctgcctcacc ggtaactctg 
ccatcgggtc caatttcaat gtgtacctgc ccgggcggca agggegaatt ctgeagatat 
ccattacact ggc ?; . . ■ • : * r ' ; ' : ' J 



.. 60 
120 
180 
.240 
300 
360 
420 
480 
493 
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<210> 10 
<211> 392 
<212> DNA 
<213> Homo 



sapien 



<400> 10 ! 
acaaaacaca 
attattttat 
gtggattata 
caagtgtcaa 
ctttgtttcc 
agtagtgcat 
ttctggcaat 



accgaggagc 
attagtatta 
ttaagcaggt 
gatcagcaag 
ttttttagaa 
ttaattattt 
ttctacagaa 



gtatacagtt 
aaaatcaaac 
attagattta 
tgtccttaag 
agttctagaa 
tttaaaaaac 
acaagttgaa 



gaaaacattt 
cctatgtttc 
ggaaaacctt 
tcaaacaggt 
aataggaaaa 
tttttaagta 

gt 



ttgttttgat 
tttcagatga 
tccatttctt 
tttttttgtt 
cgaaaaattt 
cgctgtgaag 



tggaaggcag 
atcttccaaa 
aaagtattat 
gttgtttttg 
cattgagatg 
gcatcaacat 



<210> 
<211> 
<212> 
<213> 



11 
525 
DNA 
Homo 



60 
120 
180 
240 
300 
360 
392 



sapien 



<220> \ *■' . ' / 1 
<?221> mise^feWure / 
' <222> (1) . , i ^525) ?-v- 
<223> n A/T, C : or MS 



: ■■ - 



6,.. ;-»'■■»■ i.: 



/<4 00> 11 > 
accacaacac 
accacaacac 
! tcccgtagcc 
ggcctcanng 
ncacagtagc 
cgtnagtgaa 
ccctgacagn 
cangctnaac 
aatcaacaac 



1 ' 



caggcctcag 
tctcacctgc 
aaccaggttc 
aggaatctac 
cccagatcac 
gaatccacca 
accaccacnc 
ggatacaaca 
ttcccacagc 



tgaggcafcn 
dag'tiatgaca 
tactcactca 
caccgnctac 
cagccacaac 
cctcccacag 
cnggcctcan 
ctgttacctg 
agnccaggtt 



acdaccttcfe 

agcctaggcg 

acagtgtnac 

agcangcctg 

actctcacct 

ncgaccaggc 

tnggcattct 

ccaggaccac 

caactgacac 



acagcagccc 
tcggtgaa'ga 
ctgncagcac 
agtgagaaat 
gccancacga 
tcaacgcaca 
acaacttccc 
cacctcaggc 
agcac 



<210> 
<211> 
<212> 
<213> 



12 - 
4 98 ■ 
DNA 
Homo 



cagat caeca 
atccaccacc 
caccacgcca 
ntaccacttt 
caagctcagg 
caacagcatt 
acagcaannc 
ccilagtcagg ' 



<400> 12 * 

accacagcct 

cacacaacac 

gtccacagca 

ggacttcagg 

tcacctccta 

ggctcgactc 

tcaacaatat 

acaacctcag 

acaacgttac* 



sapien 



tatcctttgg 
tcttccctga 
gcactggctc 
gagaatctac 
gcaccgcaac 
caacaacaca 
tccacagcag 
gtcgtggaga 
ctggcagt 



tcaagaatct 
cagcaccaca 
accacgcaca 
caccttccag 
agcccctgtt 
cttccctgcc 
cccagatgca 
atctacaacc 



acaaccttcc-' 

agctcaggca 

acactgtccc 

acccacccag 

gaagaatcta 

agctccacaa 

agtggaacaa 

tcacgcatca 



' .H' *; ' 

acagcagccc 
tcgttgaagc 
ctgccagctc 
cctcaactca 
caacctacca 
cttcgggcca 
caccctcatc 
gtccaggctc 



aggctccact 
atctacacgc 
cacaagccct 
cacgacgcct 
ccgcagccca 
cagtgagaaa 
tgcccactcc 
aactgaaata 



<210> 
<211> 
<212> 
<213> 



13- i 
523 
DNA 
Homo' 



sapien 



'. ' .6 r; > ■•; c 



t - 



<:400> 13* - " 
accacagcat catcccttgg 



i7 * :' 



■60 
120 
180 
240 
300 
360 
; 420 
4 80 
525 



60 
120 
180 
240 
300 
360 
420 
480 
498 



tccagaatat actaccttcc acagccgccc aggctccact 



60 
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5 
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180 
240 
300 



gaaacaacac tcttacctga caacaccaca gcctcaggac tccttgaagc atctacgccc 120 

gtccacagca gcaccagatc gccacacaca acactgtccc ctgccggc^tc tacaacccgt 

cagggagaat\ ctaccacatt ccatagctgg ccaagctcaa aggacactag gcccgcacct 

cctactacca catcagcctt tgttaaacta tctacaactt atcacagcag cccgagctca 

actccaacaa cccacttttc tgccagctcc acaaccttgg gccatagtga ggaatcgaca 360 

ccagtccaca gcagcccagt tgcaactgca acaacacccc cacctgcccg ctccgcgacc 420 

tcaggccatg ttgaagaatc tacagcctac cacaggagcc cgggctcaac tcaaacaatg 4tjU 

cacttccctg aaagctccac aacttcaggc catagtgaag aat 523 



<210> 14 

<211> 461 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (461) 
<223> n = A,T,C or G 



<400> 14 

caggtacaag tcattact.cc cccttctccc . atatgaacaa gaatttttta acggtqagaa 

tatattgggc atcaaattaa aaactttttt ttcaaaagtc tacagaatgg .atattggagc 

aaaaattaca- aagtgggtca gatacaggtt, "tttaaaaact gcattactga atttaacaaa 

agtcagacac- tagaatcata , tatttgctgc , ataaaagt.tg atttgatacc tggtggtgat 

tgaatttagt ctcaaagact cataaataaa aatctgactt aagacgtagt cataccagta 

taccaattct cccatcactt tgactttcgg cagagagatt agagcaaaaa atattcagga 

gaacagtgga gttacattgn attatgtatg tttaatataa tatcaatttt aagggtaagg 

ttaaggaaat. cttaattt.ta aggntaaacc ttgagtacct c _ r 



60 
120 
180 
240 

300 
360 
.420 
'461 



<210> 
<211> 
<212> 
<213> 



15 
508 
DNA 
Homo 



sapien 



<220> 

<221> raisc_feature 
<222> (1) . - . (508) 
<223> n = A,T,C or G 



i 



<400> 15 , , . 

cgcggcgagg- taccagtgtg tgttcgtatt tgggcacagg ctttnggg.gg ccactgcgtt 
gcagntgaca tgtgcccagg ttacagttca tttgcgactt .cgttcctttg ^gtgcacttgt 
tcacacaggc cagcttcccg tccaagacat ccacatagta gaactgggta tatccttcgg 
cagccttctg ggtgcattgc tcctggaagt caaagcccgg agtcaccgat gaatccacga 
aagtgtcctc ttcactatag 6acagtatgg cctttctgca ggaatcagga tcaagaagag 
ttgttctagt ttcattcata .atcttggcct ttacaatctc tgccaggttt tcaaacagtt 
cctcatactc taaagtgtag tctgcctcca ggatgacatc gttcttgacc acgatgctac 
cgttgagcaa tctccgaatg ttcacccctc tatactgagg aagattgtcg cccttcaaaa 
cgacatccat ccgattcttg aagagggt 



60 
120 
,180 
240 
300 
3*60 
420 
.480 
508 



<210> 
<211> 
<212> 
<213> 



16 
578 
DNA 
Homo 



sapien 



<400> 16 .i . . 

acatataaat gaatctggtg ttggggaaac cttcatctga aacccacaga tgtctctggg 
gcagatcccc actgtcctac cagttgccct agcccagact ctgagctgct caccggagtc 



60 
120 



PCTYUSOI/18577 



6 



attgggaagg 
tcacctgggc 
tttccccaac 
caacttcata 
gggagacacja 
£t;cttggt'gc 
gggtgctggg 
tctgcccttt 



aaaagtggag 
cctgcjaggaa 
cacagagaac 
ctggcaggag 

agcctggact 
tctgtgttcc 
tataagcaga 
cctaggccca 



aaatggcaag 
ttcagctcag 
aagagtttgt 
ggtgaggagg 
ctgccccacg 
cagaggcagg 
'gagggatggg 
ggcctgggtt 



tctagagtct 
cttcttccta 
tctgttctgg 
ttcactgagc 
ctgtggccct 
ccjgaggttga 
ttcctgctcc 
tgtacctt 



cagaaactcc 
ggtccaagcc 
gggacagaiga 
tccccagatc 
ggagggtccc 
agaaaggaac 
aagggaccct 



cctgggggtt 
ccccacacct 
aggcgcttcc 
tcccactgcg 
^ggtttgtcag 
ctgggatgag 
t'tgcCtttct 



180 
240 
300 
360 
420 
480 
640 
578 



<210> 
<211> 
<212> 
<213> 



17 
623 
DNA 
Homo 



sapien 



i T 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(1) . . . (623) 
n - A,T,C or 



V 



,<400> 17 
/acacagaagt 
!tggcagagca 
acagcctacc 
tcaggaacag 
taatgccccc 
.ctccatcaaa 
ggcctgaggt 
agggagaagg 
tctaaataag 
tttcatcctg 
gcccctcgta 



1 1: ga a t caca 
^tctCfccgca 
atcaacag t t 
gtagat:tttt 
c'cctcaactt 
gcggcaggcc 
ttagagccgc 
cggaaggggg 
cccaaatggc 
cggttctgga 
cctgccggcg 



aaacat'aatt 
caaactgcaa 
^tgcattata 
aaaaatatat 
ttcc'tttttc 
tacgagcdag 
tttgtgcggg' 
acggacagtt 
atcactcgtc 
accagatctt 
gnc 



a^c^ca J caataa , 
att^at£aa 
'aaciaggtagt 
aHracaagcta 
ttatagaaaar 
'cctfcjaacagg 
gatggtgga'^ 
ctttcttttt 
ttttgctcgg 
gacctgctct 



aacaca'gtgt 

ctac^acagac 

tt'cttrt^ctt 

acacacacag 

tggaaagctt' 

'gtttgccttg 

gctagggtgg 

ctc'tctagct 

tctttgttga 

cggtgaggtt 



tcaagtatx^t j - £ 60 
taaaaactat ir : " j 120 
>t£g^ttttaa^ :/J 1^80 
"c t atcagca'c- - • 24 0 
acaa fcacc t c * 30 0 
gaaaatratgt 360 
gggtgagaaa" 420 
ta'cccttttt " 480 
ttttcttcat 540 
gagcagtcga 600 

623 



<210> 18 
<211> 477 
<212> DNA 
<213> Homo 



V f 



sapien 



<400> 18 

acacaaaagg 

aatattaaag 

cttaattttt 

ttataaaatt 

ttggctttag 

attgctaggt 

catgcacaat 

ataagccttg 

<210> 19 
<211> 374 
<212> DNA 
<213> Homo 



gcatagtcct 

atggatcagg 

gagattttca 

ataacettta 

ttttctttac- 

agaaaaattt 

tttaadtctc 

tgttgcagtc 



sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (374) 
<223> n = A,T,C or G 



acaaagttgt 
gatctcagtt 
tatattgggt 
taataatttt 
tcttcacagt 5 
ctgttcagtt 
tgtagtttct 
aagtttttac 



ttatataatt 
taaggaatcc 
tatag'ctata 
tagtataarfcc 
tttaataggt 
aggagttctt 
tcataa'gcta 
cacarttctat 



gttttatgtg 
tgccttctgt 
fc'atcaggaca 
acttgtgtga 
aactatttta 
attttgctgc 
ttttactatc 
agaccttgct 



tgcaaattga 
atgatgatgt 
ggtaaatacia 
ctataataaa 
caagaataac 
tgaaat^agt 
ttacta^ttt 
gtacctg 



i ■ 



60 
120 
180 
240 
300 
360 
420 
■4-77 



<400> 19 ' 
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agaaacttta gcattggccc agtagtggct tctagctcta aatgtttgcc t ccgccatccc " 60 

tttccacagt atccttcttc cctcctcccc tgtctctggc ~ tgtctcgagc agtctagaag 120 

agtgcatctc cagcctatga aacagctggg tctttggcca taagaagtaa agatttgaag 180 

acagaaggaa . gaaactcagg agtaagcttc tagacccctt cagcttctac acccttctgp 240 

cctctctcca ttgcctgcac cccaccccag ccactcaact cctgcttgtt" tttcctttgg 300 

ccataggaag gtttaccagt agaatcctt'g ctaggttgat gtgggccata cattccttta 360 

ataaaccatt „gngt ^ . ~. 374 

<210> 20 ' " : ' J 

<211> 207 

<212> 'DNA 

<213> Homo sapien 

<400> 20 

acaagtgtgg cctcatcaag ccctgcccag ccaactactt tgcgtttaaa atctgcagtg 60 

gggccgccaa cgtcgtgggc cctactatgt gctttgaaga ccgcatgatc atgagtcctg 120 

tgaaaaacaa tgtgggcaga ggcctaaaca , tcgccctggt gaatggaacc acgggagctg 18 0 

tgctgggaca gaaggcattt gacatgt " .. ~]]\ . ' , . \ 207 

- ., r ■ - ... - . - t > - . ... . ■ ...... ■ .... . . 

<2io> 21 „ . . .. . , , , . .. ... . , . ;. , ..: . '. ' 

<2Xl> 557. /, t , . ' . T .. t . „: ... 

<2i2> dna, ■". : " " ' ■ ; r ' "..'*'' " ' \" ; ; 

<213> Homo sapien . ^ ..,. 7X ...... - ,. v 

<400> 21 

acaaagaatc cctagacgcc atactgagtt ttaagttcct taattcctaa tttaaggctt 60 

ctagtgaagc ctcctcacag taggcttcac taggcccaca gtgcccctag acctctgaca. 120 

atcccaccct agacagactt tattgcaaaa tgcgcctgaa gaggcagatg attcccaag.a 180 

gaactcacca aatcaagaca aatgtcctag atctctagtg tggtagaact atgcacctaa ' ' r 2.40 

acattgctgc aaaat^aaca cacttttaga cacccctgca gatatctaag taagtggaga ' "300 

agactatttt ttcaacaaac attttctctt tcaccctaac tcctaaacag cttactgggg . 360 

cttctgcaag acagaaagat cataattcag aaggtaacca tcgttataga catsaaagttt. 420 

ctggtcaaaa gggttatagt taatgctctg cactttttcc tgcatcttat gca.ttacaat 480 

gtctagtttg ccctctttcc ctgtgtttgt gtcataatag taaaaaatcjt cttctgt'tct 540 

ggggtcatag cacctcg 557 

<210> 22 . . , , 

<2ii> 541 . : /, .... , 4 ' . \ \ 

<2i2> dna : \ ;.; _ ; ' :^ r? ,";V- 

<213> Homo sapien ... ^ . '" , . ; 

<4oo> 22: ^ : ; ^ fi \ : \ : : ' " v ;; J : 

acctaggtgc tagtctcccc, actaactgag . ggaaaaaggt tcccaggtgg .ggtcctctgc 60 

ecactttgcc accacattca cattccaaat gggataatgc ctgaggggcc aagagtggtc 120 

aggctgccct ggggtgaatg tcaccctgat gaggcccatc 1 agctcttgqc ca'ptcagtga' 180 

ggecagactt gtgctctaat ccactctcct gtgggtccct ggcctgtatg gcttatactg 24 0 

gggagctggg cctctgggct gtccaaaccc ."aagggtcaca cttti'gctttt cctttgttgt 3l00 

ccccattttc catccttgct ctaagacaaa acttttccca gagaagaact ctttgttgtc 360 

cccgctcagc tgtaattctg ccttttctac cttcattcca tccttcctct gcccagataa. . 420 

agtccagcag aaattcctcc tttctacctc tctgggactc tgagacagga aatcttcaag. . ,4 80 

gaggagtttt tccctcccca ctattcttat tctcaacccc cagaggaacc aaggctgctg **. 540 

t 541 

<210> 23 
<211> 486 

<212> DNA . 

<213> Homo sapien , [ f 



AVO>01/9<$390 
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PCI7US01/18577 



<400> 23 

acaaaattgt tggaatttag ctaatagaaa aacatagtaa atatttacaa aaacgttgat 
aacattactc aagtcacaca patataacaa tgtagacagg tcttaacaaa gtttacaaat 
tgaaattatg. gagatttccc aaaatgaatc taatagctca ttgctefegca tggttatcaa 
tataacattt aagatcttgg. atcaaatgtt gtccccgagt cttctacaat ccagt6ctct 
tagaaattgg tttctctctt; tgggagattc ag r actcagag gcagccagag gggacaggtc 
aagagctgaa ataatcacat aactactcta attttcttca ttctattgac tgtgtcaagt 
tatagacaca gccaaagtgt ttttcttcgg cctctgatga tttgagaaga tgaagaacat 
gagcaatttc tcattgctta aagaaaaact tggcacataa gaggctgagt gtagtagagt 
atctgt 



60 
120 
180 
240 
300 
360 
420 
480 
486 



<210> 
<211> 
<212> 
<213> 



24 
450 
DNA 
Homo 



sapien 



;<40o> 24 ; t ." 

a,ptgatacat gctataacag ^gatgaactt* cgaaaacatg ctaa£i%aaa ^aaigcca'aat : 

ccaaaaacaa taaaaacaca tattgtatcc tcaccctttt c r gcattttag t^agcaatca 

ttgcatatga atgtttatgg gaaaaatcaa tgtgtgctaa atcattgtat tccagtaaat 

agattggact taaaacttga tacagaagtt gcaaataagt gggattgagt ttgattatta; 

tatagaaaat aattacatga ttcatttaag aataataata tccaccattt attgagcact 

tactatgagc ctgtgtgcca aacatttcat gcatttctca tttaattctc acaataatcc : 

tgtgaggtag aagctattag gttgaatcat atgaacttgc caatatatga taatttctaa' 

gagttgggaa tttttgagga tgtgaatggt 



60 
'120 
180 
.24 0 

[ iob 

360 
' 420 
450 



<210> 
<211> 
<2l2> 
{213 > 

'<C220> 
<221> 
<222> 
<223> 



25 

636; 
dnA 

Homo 



misc 
<D 



'■;:v 



sapien 



feature 



Of: 



n - A, T, C' or G 



<400> 25 

gcaggtacac gtagcgcttc cccgacgtct tgtggatgat gttcttgncg taatagtagc 

gtaagccccg gctcagcttc tcgtagttca tcttgggctt atttttcctc tttccccacc 

ggcgggccac ctcatcgggg tcggcgagct taaactccca tccgtctcca gtccagctga 

tgaatgactg gcaggatttg tctgatagca gctccaggag aaactgccac agctgaatag 

gtccacttcc tgtgaagccg gccagcacag ctgcaggtat aactggtttg ccttgctcca 

ccgggtcact cctctcttgg atgtaatcct tgaaagacat ggttggctta ttgaggcaga 

gagactggct gcagtcatct tcgaagctct j cgaaggaagg aacccgttgc acatccagca 

aggacgactg gctgttccag gactggagga' gggagtctga gctctcgaag ctgtccgcac 

c.gttctcagg ggagtcgtgg tctttgggc^ tcccagaatt gttggtgagc aaattcaagt ■■ 

tgctgcctgg gaagtcctga ctgacagagc ' agtaggtgac gctgaicggag ctgagccgag : 
acttggggaa catctgaaac trictgctcaa agctgagt ' ' : : ■ ' 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
638 



<210> 26 

<211> 4 69 '' 1 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) - - - (469) 
<223> n = A,T,C or 
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<400> 26- ' } "' \ ; ' ' 

naggtaccaa atggagaaaa ct'btttccgg agacgttcat' catcaatacc atcatcaaga 60 

tttttcacat aaagattaac accctggtat ctggtgatcc tatcttgttt catctgttca 120 

aatttgcgct taagttccgt ctgccgttcc acctttttct gagctcgacc aacataaatt 180 

tgttttccat tgagctcctt tccgttcatc tcatccacag ctttctgtgc atcttcatgc 240 

ctttcaaagc ttacaaatcc aaatcctttg gattttccac tttcatcagt cattactttc 300 

acacttaagg caggcccaaa cttgccaaag agatccttaa ggcgctcatc atccatgtct 360 

tctccaaaat tcttgatgta aacattggtg aattcttttg cctagctcca agttcagctt : 420 

ctcgtcttta cgagacttaa atcggccaac aaatactttg cgatcattt 4 69 

<2ii> 364 ' ;? ".-r -•- - ' • ■ ;•• • - • >• / • ;t /*" . \ \ 

<2i2> dna ' 1 ! — ' ; ■■.;>=;• - ; : y" ! " : ' - " ' ' \ r • ; ;' . 

<213> Homo sapien ' • • : - • ; : - • - f •• ' ' ' ' y ; ; 

<400> 27 • ; • . ' ' • •• s • " '•' - ' : . ■ - \ 

actctgctat ggtgctggct tcctttaaac tcaggataga r tgccaggtgg- gctccgtttc 60 

cgtaagactg acactegagc teggcatcag accagttcct cagcttcctg aagtaaccat 120 

agcaattgga cttgtggtaa aaccatccag gagcacagct gggtctcatg atgatatcac 180 

ccaggactcc tgttttggcc aggcagctca gcaataggag cagccgcatg cttctggaag 240 

ccatcttcct cctaccctga ggatgtagct agtgcaagga tctcagagac cttactagcg; ■•■ 3CT0 

cttctttgaa actcctgggt tctccttgat ctgcaaatct gtttggcaac caagactcta;" ' 360 

aggg * 

<210> 28 
<211> 714 

<212> DNA ; ' .. . 
<213> Homo ' sapien : c ■ 



J .J O • ! i. * ... 



' - i , ■ ■ - ■ "' - ' ••• ■-•^ -■.r- 



' sr.;. 



1 



<220> . . t 

< ; 221> mise^feature - n ' • : : ' c ' 1 '■' ' ; ' E - : - ' ' 3,5:t JfD ' J35 ->-" r - r ri : '' 7 f- J 
<222> (i)...{714) r;j£ °^ :n -t ' J " ; r • 

<223> n — A/T,C or G 

<400> 28 

ccttcgagaa gatccctagt gagactttga accgtatcct gggcgaccca gaagccctga 60 
gagacctgct gaacaaccac atcttgaagt cagctatgtg tgctgaagcc atcgttgcgg ■ 120 
ggctgtctgt agagaccctg gagggcacga cactggaggt gggctgcagc ggggacatgc / 180 

tcactatcaa cgggaaggcg atcatctcca ataaagacat cctagccacc aacggggtga 240 

tccactacat tgatgagcta ctcatcccag actcagccaa v gacactattt gaattggctg 300 

cagagtctga tgtgtccaca gccattgacc ttttcagaca agccggcctc ggcaatcatc 360 

tctctggaag tgagcggttg a'ccctcctgg ctcccctgaa'ttctgtattc aaagatggaa 420 

cccctccaati tgatgcccat acaaggaatt tgcttcggaa ccacataa€t aaagaccagc 48<3 

tggcctctaa gtatctgtac catggacaga ccctggaaac tctgggcggc aaaaaactga 540 

gagtttttgt ttatcgtaat agcctctgca ttgagaacag ctgcatcgcg ; gcccacgaca 600 

agagggggag gtacgggacc ctgttcacga tggaccgggt gctgaccccc ccaatggggg 660 

actgtcattg gatgtcctga agggagacaa tcgctttnca tgctggtagc tggc 714 

<210> 29 
<211> 373 
<212> DNA 

<213> Homo sapien \ 

<400> 29'- - ;: ' ■= 11 v " ; ' ; ' ' ' •• : -' • ■ ' * . 

acttgagatc cacagtcacg ' tgaactttgc : cggtctcttt ac^tctgccc actt6atttt 60 

cattctttcc ttcccacaca atggtttttc caatg1:gcaa gaatgatttc tcgacaaatt i 12 9 
cccggacact atggacctcc ccagtagcta taacgaaagb cttccggtca : tcattctgca 1 180 
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acatcaacca catagcctcc acatagtcct tggcatggcc ccaatctcgt ttggcatcca 
gatttcccaa actgaaacat tccagttgtc caaggtaaat cttagctact gaccggctaa 
tttttcgagt aacgaaatta gcttctcttc tgggactctc atgatbgaag a^aatgccgt 
cactgcaaag. aga . - , : :r 



<210> 
<211> 
<212> 
<213> 



30 
485 
DNA 
Homo 



240 
300 
360 
373 



sapien- 



<400> 30 

aaaactacga ctcagcatac attttcccac atacattttt acattgtacc ttaggactca 

gtcatctcca cttaaattga tgacacaagc agctaataac catttctggg tttctgccta 

accccctaat tgtctgttaa agccaattct ctgggtgtcc cagtgagtgg tggcttttttv 

tctttccaca ttggcacatt cacttctccc actcttggca tgtaagaaat aagca;tt;tac 

ataattggaa aaatctggat ttctgatgcc aaagggttaa agcttcttgg atttcatttc 

attgatatac agccactatt ttatttttga tcagtggcct ttgggccact gttcagggta.- 
etgaccatca, gtgtcagcaji-^taggg^ttijt^r _#t£tf tgfct;t cttttgggtc . tfctctttttt 

.gg A cacatgtg, : aatcttgttt : ,^g^ t taaaat gajaattacfct , tctcttgttG:: tctgatgatg- 
rggttt . : ~ } ,: V ... = .. 



;: o u 



, A\ ";■> -<~\ j 



i ; 



". V (*' )*.-..;:. ' '..>_j 



<2d0> 31 . - * c .... 

,<2dl> 342 V \ ^' -' ; j"' f 

■<;212> DNA " " "" '*"' ' 
<213> Homo sapien 

<400> 31 

acacattaag catccccagt tcccctcgca cacccctttt cccagccact agtaaccatc 

cttctactct ctatatccat gagttcaatt gttttgactt ttagatcccg caaa^aattg 

agaacatgca atgtttgtct gtttctggct tatgtcactt aatatagtga cctctagttc 

catccatgac tccttaactg cccctgaatt tttgacacta ttatttttaa gtattttgga 

aaactcacac ctgttctcat ttttaaacct taataataac aatttcctac taagctaata 

aaacttcccc ttatattatt tgtaatgtgt gcataacata gt fx 1 *, 



<210> 32 
<211> 331 
<212> DNA 
<213> Homo 



i.>: 



sapien 



<400> 32 : . r r : .' . ; p . . , . , .,. . ^ 

acagtatgtg gcatttccag gtatgactga gtgtgagaga catgtca,gag gctqttcagt 

gatttcttgc tattgaccga tgcttcactg tgccaaaaga ^gaaaaa'aaat gttgggtttt 

gtaattaaat tatttatata . tttttgaaac ccgaattgaa v^atgttgcag gcaacgggct 

acagctttat tagtggttct ctaactgtgg tctccttggg ^ccaagcaatt tctttaaagg 

aaaagttgat : tatgtatgtg gagtgccagg accacigcct tgaaagcaag tgtgattttt 

atttttaata ttattttatt "tgtgtctgtg- 1 . .'. ; . r , ... 



<2io> 33 1 * ; ! 

<211> 381 
<212> DNA 
<213> Homo sapien 

<400> 33 

acactgttgg tgttatatgg ggatggggtt ctcggtaatt ttgtttatta tttatgttta 
ttattatgtt ttatcattaa ttattcaata aatttttatt taaaaagtca ccctacttag 
aaatcttctg tgggggtggg .agggacaa^a gattacaa'ac caaaactcag gagatggtaa 
cactggaatt .gataaaatca. :cctgggatta gttgtataac t'ctgaaiccac 'dcaaacctc 
ttatcaagcc ttgctacagt catggct:gtc. gagaaagatt, 'tacajgttatt : tt;tctgagaa 



60 
,120 
180 
. 240 
300 
;360 
.,420 
,480 
485 

": \ t- ' ' 
- - r 



* 6 0 
120;. 
180* 
240 
300 
:342^' 



60 
12Q 
180 
.240 
300 
331 



60 
120 
180 
240- 
300 
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aggatccatg ggbtttaaga acttcagaac tttaagaact tcagaagttc ttaagttgct 
gaagctcaag taacgaagtt g "' : 



<220> 31 3 ■ ■ : - : ■ '* 
<221> misc_f eature 
<222> (lV. . . (567) ' ; 
<223> n = A, T, C or G 



<210> 36 

<211> 265 • ! 

i'212> DMA 

<213> Homo sapien 



;360 
381 



60 



<210> 34 ' • ' : ■ •• ■ ' 5 

<211> 315 f ' '• - 1 
<212> DNA 
<213> Homo sapien 

<400> 34 ^ ^ 

acgaaactgt atgattaagc aacacaagac accttttgta tttaaaacct tgatttaaaa 

tatcacccct tgaggctttt ttttagtaaa tccttattta tatatcagtt ataattattc 120 

cactcaatat gtgatttttg tgaagttacc tcttacattt tcccagtaat ttgtggagga 180 

ctttgaataa tggaatctat attggaatct gtatcagaaa gattctagct -attattttct 240 

ttaaagaatg ctgggtgttg catttctgga ccctccactt caatctgaga agacaatatg 
tttctaaaaa ttggt 

<210> 35 

<211> 567 : J" r - : ... ... . .1 . • • ^— : - 

<2i2> dna • <• "• - . y y •• ■ ' " }:: - : ;; ~ : :[ . .' : ' : . : 

<213> Hdmb : sapien ;7, ° : J ' '; J : ' . J ; ,. 



300 
315 



60 



<400> 35 4- 4-4-4-4- 

tacttcttaa aanacatata acacaatgtg gtagtagtag gtgtaaggaa ggtaagtttt 

ttcatagtgg tatgcaaaca tatcattgaa atattacata gatataaaga cttagggaat ;^20 

aaaaatagca gcaacaaata cttgatagat ttatcctact tgggagaaat atttfcgtagc 180 

agagtattta gtatacttag aagttgattt agcaattagg ctttaatgac cttacaaagt 240 

gaacataact gaacacaagt attttttcaa tgcaagatga ggatgaaaat tt^acatttc - -300 

aacccatctg : ^ctaaag t ta ( agac€tagca waaaatfcaaaa -egtfcgccttt: gt ccaagtat 360 

aga 1 1 a a a a ea aa cat a tt'tgggtgtg ; 'taatttgaag: ttttggactg aaatatcttt 420 

gcaagtatcef acataaaatt 1 ctgtaatgcc ttataattat atitctaataa- ttatgcatta 480 

tactaagaca ccattaagaa cagttgangc actacactaa atcaaaccat aaatgaggaa -540 
aaaactttta' atggtc'tttt ctagaag ' : 1 ' 5 - 



567 



<400> 36 ^ ' 

acaagtggtg gccacagaag taggggggtc ttccttaagc tctgtgtcag agttccacct 60 
gatccttatg gatgtgaatg acaaccctcc caggctagcc aaggactaca cgggcttgtt 1 120 
cttctgccat cccctcagtg cacctggaag tctcattttc gaggctactg atgatgatca 
gcacttattt cggggtcccc attttacatt ttccctcggc agtggaagct tacaaaacga 
ctgggaagtt tccaaaatca atggt 



180 
240 
265 



<210> 37 

<211> 476 

<212> DNA { ^ . - 

<213> Homo "sapien 1 " ■ ' - ~ :i ! - 4 .' ' " "' 

, ^ .-. ' ■ - . * . * : r - ■ ' ■ J ■ .• . . -■ > i 

<400> 3T ^ ' ' * ■ *■■ ' : , ^ . . . 

actgtatgtg ttttgttaat = * tctataaagg -tatctgttag atattaaagg tgagaattag 
ggcaggttaa tcaaaaatgg ggaaggggaa atggtaacca aaaagtaacc ccatggtaag 120 



60 
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gtttatatga 
aacccctctg 
attgaagatt 
aaaaatactc 
cctcaaatct 
aaacaatgtc 



gtatatgtga 
attggctatt 
catcccatac 
aagatcattt 
gagggacttt 
atttagtaga 



atatagagct 
attactatat 
ttctatatac 
atatttattt 
taagaaatgc 
atatttcagt 



aggaaaaaaa 
ttattattat 
catgcttaaa 
ggagagaaaa 
taacagattt 
atttaagtgg 



gcccccccaa 
ttattgaaac 
aatcacgtca 
ctgtcctaat 
ttctggagga 
aatttcagta 



ataccttttt 
cttagggaag 
ttctttaaac 
ttagaatttc 
aatttagaca 
tactgt 



<210> 
<211> 
<212> 
<213> 



38 
424 
DNA 
Homo 



sapien' 



<220> ; r.. 

<221> misc^feature 

<222> (1) ,:,t.:C424h 

<223> n = A, T, C or 



;i "*• V £ : 



180 
240 
300 
360 
420 
476 



<400> 38 

tacaagaacc 

tnantggaaa 

cagttggact 

tgtaaaatat 

gtctactttt 

gttggttcag 

tttgggggac 

catc 



tcactcactg 
cctgattctg 
gatcatctct 
atatatgtgt 
tgcaagttac 
tttctggggt 
tacttttctc 



gacattgann 
gcagctcatt 
gagggccttg 
atatatataa 

tgggtttctg 

catctgagta 
atgcaaacag 



ttctactgtc 
tatcttggtt 
aagccctaac 
tttcagctga 
tcttcaccat 
gagagattct 

gggagctcct 



caatcccaac 
tcctcatttg 
aagtctatca 
gaagtgagtc 
cttctgaaaa 
gaaacagaca 
ancaatcctg 



tnactgctgt, 

taaggtcgtt^ 

tgatcc^caga 

ttcacaccaa 

gtctgcttct 

■ctgatgttaa 

agagcpgctg 



<210> 
<211> 

:<212> 

o <; 

<4Q0> 



39 

4 93 , 
DNA . 
Homo 



T 



; sapjLen : 



39 



■■•P. 



a ca 1 1 g t a g,c ,-ce t c tgce fecr at 6q c fc£a. 
ttgtttcaca jtga^t-fetcagg ga£catige,aa 
ctgtaaagca ;-:gat£caagta- tcectqacc/t 
a c t c 1 1 ca a g t ^t tea accac.t- . : gt t a a ga cg t 
aattgcacag taggatgtgg aacctgttta 
atcaaaaagg tctcaccaca taccatgtgg 
cccctgtttg catgagaaaa gtagtccccc 
ctctactcag atgaccccag aaactgaacc 
aagacagaaa ccc 



^aeagctgcat^ 
: ,agaaGgctqt 
Gaaagaaaca 
cctattgagt 
atgttatgag 
atgcagcacc 
aaattaacat 
aacagaaagc 



.1.1* J ; 



ctocaggaaa 
tjttccaggtc 
gacgttgtotg 
tctcaggatt 
cagtgtctgt 
agacttttca 



^tcta^tact ; , 
ag^^cb^ca^. 
tQcagcjtctt;' ; + 
ctcagatggg/;;, 
ttatttccta^ . 
gctaggagct \ 
ttcagaatct, 
gaagatggtg 



<210> 
<211> 
<212> 
<213> 



40 
464 
DNA 
Homo 



sapien 



, 51. ■ 



<220> , 
<221> misc_feature 
<222> (1) . . . (464) 
<223> n = A,T,C or 



. 60 
120 
180 
240 
300 
360 
420 
,424 



6:p 

420 
3.80 
240 

300; 

360 
420 
480 
493 



<400> 40 



acaaaacaca caaacatcac tttacttgga aaattatttt catcatactg taaacatatc t 
ttcccctaca tctggacatt ttgaaatagt ctttggtatt actagttatt gtgctttgaa' 
acagaaactt gcagaatttc tgtagtagtg ctacataaag atataaataa gaaaaatgca 
.qttggaa^aa gttacat^ta -gptgottttg^cataa^tttQ aaa.aactaca gtgtatgcct 



60 
120 
180 
240 
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0 ^y/fJS v 01/18577 



agtcacagtt ttatgagaaa gaatatttcc tttttcaact taattttaag gaacacttaa 
tcattttggd taagtatcca tttttggagt ggatctgatg agttgcatga cactaaactt 
ggatgctctc catttgctga aaggcacatt tttaagaatg gattgnatag aagttgatcc 
ttctggatct cccatatctg ctctccagtg acaactgnct tgtg , 



300 
360 
420 
464 



<210> 

<2ii> 

<212> 
<213> 



41 

557 
DNA . 
Homo 



sapien 



<220> 

<221> miscjE eature 
<222> (1) . . . (557) 
<223> n = A,T,C or 



acagtgatag gtatctttct ttggagtttt ttttttgngc atatgtgtat agttttatgg 
gttctgagtt ggtgaccana aagttgcatg tagngctggc acttacttaa taactattca 
tgatattgtt aataacttgt; tataggattg tattcccaat tacagtctct aanattgtaa 
ttgatattat .ctganaggnV gngngacaac; tttcttttgt tgttacatta agfcc^aaa^c 
ataatactaa tagacaacta acagtttgct / t^tcaggcat: atcaactaao^ gcacc^cccc; 
ccatgctaag tttctcctgg atatatggaa gttgattgtt tcccagttria aaaacttgaa 
ctaatatctc ctaaaaaaat ctgagtccat attgttttta ttttacttag ctanaatctc 
atagcangtt aaagtcatat ccttatcccc actaaaaata actatgtnta tgtgagagga 
atatagtatg tgggagctgt attaaatact attacaggtg ttacagaatc tttaaataaa 
tggacatgga ccaactt 



60 
120 
180 
'240 
300 
360 
420 
480 
540 
557 



<210> 42 
<211> 255 

<2i2> dna : ; . 

<213> Homo sapien 



<400> 42, r .'_„. - 1= '■: J-C-r: - «••,-; . ; , 0 " ^ 

ac^atcag^c;tttgtgctga tt t.cct gaac; aaactgcat t; at al: ta t gaa aacaaaagga 

aaagaagaaa ^aaiaaaaac, tataptccca^ tatttcactt ' acagtgtttg agttcctgga 

aggacctata' 1 tak'tggaggc agcattcaaa caagaaatta tgccaatcaa ctgtcaaatt 

ttcactataa ttttcptaaa aaggcg^ttt tcpcccaata tctattaatc tcaaagaaac 
ataagttgtg aatgc , ., ■ .= 



60 
120 
180 
240 
255 



<210> 43 

<211> 349 

<212> DNA 

<213> Homo sapien 



<400> 43 ^ . . 

actccagcag atttaatatt ggcatccatc atctagtcaa acctctcaca tgttcttcaa 

atgaatcaaa tttgggattc tcaacatttt ctgtgtcaaf aaaaggtgtg ^gaattagtag^ 

attcgatgaa.gapctgtttt tccttgccac attggacttc cagacgccat ttggattggg 

tttagaagat ggggaaattt agaagacgtt tcttggcctg agtctcttaagagtagagat 

gcagaagaga : gagtgagacc aqgaagaga.c J tggc jtgttga ctgca^ggca ccaccagccg 

ccttggtggt ggcattagtt ggatttgggg ccaacccaga' gttggaagt , , ; 



60 
120 
180 
240 
300 
349 



<210> 
<211> 
<212> 
<213> 



44 
483 
DNA 
Homo 



sapien 



<400> 44 



t i 
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accaaaccat 
gttttataaa 
cgttttttgc 
agcttggaca 
gactttttaa 
ttgggagact 
cgttgcactt 
agcctgactg 
agt 



tttatgagti 
aaattatat't 
t^ctatggtc 
gaaagaaaag 
ttttatatat 
ctgcattttt 
cttttttggg 
ttttataatt 



ttctgtta.cjc 
tttattcagt 
ttagcctgta 
aaacttggtg 
atacattttt 
tattgtggtt 
agatgtgtgt 
tgggagttct 



ttgctttaaa 
aatttaattt 

< ■ 

gacatgctgc 
ttaggtaatt 
tttccttctg 
tttttgttat 
tgttgatgtt 
gcatttgatc 



aattattact 
tgtaaatgcc 
tagt^tcaga 
gactatgcac 
caatacattt 
tgttggttta 
ctatgttttg 
cgcatcccct 



gtaagaaata 
aaatgaaaaa 
ggggcagtag 
tagtat'ttca 
gaaaacttgt 
tacaagcatg 
ttttgagtgt 
gtggtttcta 



<210> 
<211> 
<212> 
<213> 



45 
281 
DNA 
Homo 



sapien 



<400> 45 

acat cgagaa f tccacgcccg gggacpagta ' c 

ctgctgcgag.. ggaaggacpa. c.gtggtct pa gatttctcag ; a^catgga^" £ttaaaatat 
cttcatgaga. apc.tccct'a]:/tcctbaaaaa"'aacaccaaftt W^aaanannn 1 sVci^st 



gagg gactgcttac" tactaagt^cf 
tcag ajgcatgga^; ^ttaaSatat ^ 

^ tj . tea gaga" ^acaccaact ga'aaa^agee"" ag^aaa'aedd 

g^gaat t ttc ; .' ^aaaagcrtci^ tpaQg.tta^.; cttgtcttat" ctcaggagag "adbccgdtct 
tgtctcccag ttcctg^ ' ^ % " o : 



<210> 46 

<?li> 587; T ; 

<£l2> DNA 

<213> Homo sapien 



<400> 46 

acagcccggc 

atttgecate 

tggagggaaa 

cttctagtca 

cacagcccct K 

agaaaagcac^ 

ctcttggact; 

gtgatacgga 

cttatttctc 

ggtagttata 



ctcccttgat 

aagecattte 

aagaaatacg 

catcagcagt 

gcagggaaag 

a cg.ca.ti a g t g^ 

ctcggcaacc 

ggttagga.tig : 

aaaaaagagg 

tatacgataa 



gcatttggcg 

ccccttttgt 

gcctcaacag 

gttctgeett 

gfctaajbctta., 

aaatcftcag't [ 

acaa'caiaat 

ptacttgetg 

gacagtgaaa 

gttgtatata 



cgttcctgaa 
ttctaaaatt 
attaaaaaac 
tatgtagtag 
eggrataatec 
etttt'eag'ta 
agttgaaaga 
caaacaagcc 
acaaaaacga 
tgatcactgg 



aagttgtgtg 
atttcagaga 
aaaagtcaca 
ttgggcatat 
aegtgagatt 
atgacfyata-c J 

tca^gattcf<j' 
ctadtt'tgcfc' 
cattgggaca' : 
tagecta 



taaaggaaga 
tgtgtgctcc 
cttaag'gatc 1 
aatccttcca 




'ctcc^cgaSa 
ca&catcctig 
tgctgctcaa 



<210> 
<211> 
<212> 
<213> 



47 
317 
DNA 
Homo 



sapien 



<400> 47 

gaggactctg 

ttgtcgtttt 

tctggttaca; 

taaaagtagg 

cttagtccat 

ctcctaaaag 



acagecataa qaggagtgcc acttcatggt gcgaagtgaa'; cactgtagtc 

cccaaagaga actcegtatg ttctcttagg" ttgagtaacc cactct^aat 

tgtg.t:ttttc tctccctcct taaataaaga gaggggtta'a acatgccctc 

tggttttgaa. gagaataaat tcatcagata-' acctcaagtc aeatgagaat ' ' : 

ttacattgee 't'tjggctagta aaagecatet atgtat^tjt cttacctcafc' 
gcagagt , yn . . . , 



<210> 
<211> 
<212> 
<213> 



48 
512 
DNA 
Homo 



r ■ "i 



6b 

120 
180 
240 
300 
360 
420. 
480 
483 



60 
120 
18 0 
24 0 
201 



60 
.120 
180 
240 
-300 
360 
.4'2a 
"4 80 
"54 0 
'587 



60 
120 
180 
240 
300 
317 



sapien 



<400> 48 
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acacttgtat ggcttttcac cagtgtgagt cctcaggtga gcttttaaat gagaagactt' 
ggtataaact tttgtgcaac cagggtaatc gcagtagtgg atgcgtcgtt tctccaaatc 
ggggttactc cttctattgt atctgacagg ttggatgttt tgtgagttaa ctggcagggt 
ggtgggtaaa tttggattgt. gaattgccag tttagaagca attgtagcag cataggatgg 
aggtggggtt aaattctgga gcatctctgc . ttgtctatct ggacttccag gctctgagct 
tggtggtgac Igggggaaagt aagtggcctg ttgtggaaga aactgacttg gcattgtgta 
tgtgcaaggg ggcatgccct; ggaattgttt cactgcagtc tgcggaacag cagaggtgtg 
tgtgttaagg cdtgccatgg cagctgacat agaaacatta agagtgtcca ttgctgctgt 
ctgatttgta gaactgggca tatctagatc . pg " M J 



60 
120 
180 
240 
300 
360 
42Q 
480 
512 



<210> 
<211> 
<212> 
<213> 



49 
454 
DNA 
Homo 



sapien 



<220> 

<221> misc^feature 
<222> (1) :..,(454) - 
<223> n -i,T,^pr G 



J 



<400> 49, " ■■ • ( , . ■; ■/ , . , , .... .. 

acaggattca ctaactgttt cgaatgaagc ccaaactgcc ...aaggagttta ttaaaatcat , 
agagaatgca "gaaaatgagt.. atbagacagc aattagtgaa aactatcaaa caatgtcaga 
taccacattc aaggccttgc g.ccggcagct tccagttacc, cgcaccaaaa tcgactggaa 
caagatactc "agctacaaga ttggcaaaga aatgcagaat gcttaaaggc tgaatgtagg 
attcttcagt atgtggaaag acaaggattc aacgtgtggt catatgataa ataagtgatt 
tataaacaag agtgatattt tgctagggct ttcaaagtta accggttttc tagcctcatg 
gaatactgtt gaacctatag, cgttgtcttg attcttttgt gttctctgcc ttgtaatttt 
ctgttactgc tatatctacg tgtaaatctt tntt 



; 60 

' 120 
.180 
240 
300 
360 
- 420 
4 54 



<210> 
<211> 
<212> 
<213> 



50 
374 
DNA 
Homo 



sapien 



<400> 50 

actatcccat gttgcgcagt aatagatggc ctcgtcccca gtccggagtc cggtgatggc 
cagggcggct gacgtgccag acttggtggc agagaatcgg tcaggaattt ctgagggacg 
gccatcattg tgataaatga ggagtttggg ggctgttcct gagaattgta gataccacga 
cacataatta gttccaatgt tggaggcgct ^ccagagcag gacatggaga ccttctgtcq 
tggggccgca gagactgagg gcggctgcgt caagatggac ; tgggcccagg accctgtgca 
gtgaatgaga agggtgagga j ggagagggga gcaggtcatg atgaagattg tccc.gagtcc 
tgccttctgc gctc ; \ . , } = * 



■i r 



60 
120 
180 
240 
300 
360 
374 



<210> 51 

<211> 250 

<212> DNA . . , " , . 

<213> Homo sapien : 



) :■ 



<400> 51 

accagatatt ttctatactg caggatttct gatgacattg aaagacttta aacagcctta 
gtaaattatc tttctaatgc tctgtgaggc caaacattta tgttcagatt gaaatttaaa 
ttaatatcat tcaaaaggaa acaaaaaatg ttgagtttta aaaatcagga ttgacttttt 
tctccaaaac catacattta tgggcaaatt gtgttcttta tcacttccga gcaaatactc 
agatttaaaa 



60 
120 
180 
240 
250 



<210> 
<211> 



52 
351 
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<212> DNA 

<213> Homo sap'ieri - ■ .J 

<400> 52 ; r ' ; ; - \" J. 

acgaaagggt' |:t£taccaat: attcacta&g. tattatgca^ t^tttatatc ' tttfegtatgt 60 

aaaactttaa, ptgatttc^g tcattcatca 'atgagtagaa^ gtaaatacat' tatagttgat • 120 

tttgctaaat cttaatttaa aagcctcatt ttcctagaaa '; tctaattatt ? cagttattca ' 180 

tgacaatatt ; tttttaaaag taagaaattc tgagttgtct J tctt^gagct gtiaggtcttg ' ' 240 

aagcagcaac' gtctttcagg ggttggagab agaaacccat tctccaatct cagtagttt't * ' 300 

ttcgaaaggc tgtgatcatt tattgatcgt gatatgactt gfctactaggg t- : : " 351 

<210> 53 

<211> 546 : -. •'• ». 

<212> DNA ' v - ^ 

<213> Homo sapien ■ 

<400> 53 

acatggacat tctgcaaacc cagctgtcac atttttcttg caactccttt tgcaasfegca '■' ^60 

gactaaaatg ttttaaaatg tgaaaaaaca ttattttttc aaagcaagaa aataatttac 120 

tgccctctta cataatgtat ttataaagtt tttccagata aactaatcaa ataa'attaga : 180 

ataatgtgac aacattacaa atttaatttg ttagctgcat tccttctgat gttaccacga 240 

tagaatgtta ctgatgattc agggctattt ctgaagtctg tatgttgctg ctgtccccag : ' 300 

tgatggtgga cttatctttg ccttacctga 1 ^cab'aaatta' t^fttg^rggaa aataaagfatt ' J l - 360 

taatatttct ttaaatagaa aaagaaltttg ; £tittg<5tc£ tttaagagca;atgagaaaa£ : 420 ; 

gatggaatgt^ t^actgt^tt tggcacacag; gacacggacc* ttcatggaag tccttgc'tdt r 480 f 

g.cgtggcatc' tgtcagcttt" tcacctttca ttcttattct tcacttttgt:' tgctgagcct 540 * 

agctgt . \ . ; r ■ ■ ; ■ - ■ / i , ' .- 

<210> 54 ■ ' " ; ■ ■ * ■ > . r- 

<211> 631 : '■" • ' ' ' ,'- 

<212> DNA 

< 2 1 3 > H omo s api en 

<220> • * -.7 : \. 

<221> misc_feature -.^^y-, or:.y ii > 

<222> (1) . . . (631) 

<223> n — A, T/ C or G ■ - 

<400> 54 .' r ' • - * J", : .> . . ,„ f - . 

acngttttaa ccaatacnha riaa^cantaa agcaataata tctgaagcat ; tatttaagaa 60 

atctcaatad acgatdtctfg 7 aagttcctaa" aattctggca ctaattctaa tgtgaactta 120 

gtagcaaaag acccagaaat agtaagccct tgacctaaaa actaactgat ttgtat'gata * 180 

ttcatgcaga aacaatgatg aaatggagtc aagttttcta gtgtcattgt tatcaaaata - 24 0 

actgtcaaaa tagtaagttt gaaacttaaa tgagcacaaa ataaaatttt gttttctaac 300 

aagaccagat ttctttttaa aaataattct gagttagaca aagtgatttt cctaaaagct 360 

agctgaagct accttaaata tcccctattt taagttacag catctctaaa taagttaatc - 420 

acacaagata gtttaaatac acctttaggt gtaggggagg ggagaagcgc ctctttttct' 480 

aatgcagctg ttttaatttg aagcttttgc acaaaatcag atagaaacat taatgcctaa" 540 

ctcataatga cccttgatta cttgtaattt tggactagaa ataatgtggc tttgaacatg : 600 

ccagtgttag accatactga cttaaaaaaa t 631 

<210> 55 r ■ _ ;.i v r . ; .. : . ; .7.- 

<211> 4 08 ■ ' : — ' / » 1 • - v- 

<212> DNA ^ ' ■ ^T- ' - • . - 

<213> Hoiko sapien 1 ''' ^^ 'i : ->\ : u < U - .. y •. . ■; , ; 

<400> 55 

accaatatat ccccagaaag aattgcaatt taccaaggtt ttcacgtgtt ttgagagaaa * 60 
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tcttactgaa agactagtga tgtccatttt ccagtaaata ctgagcgaaa aacaattttt 

ataccccaat ctgaggtata aacttgcttt ttgtgggatc acaactgctg taaattagac ' 

aattgtagca acaatccaag acaataacag aatgcctatg acagtctgcc atattctggt' s 

gagtgtctat caaagctcat catgattttt tgtgagatct tccccgtaat tggtagcttg* ; 

gcttccaaca aacatgttcc agttctccaa tatttcctct ttagttagct t'ctcatcctt 

gtttttgtct gattcatata ccagatgcct ggcctcagcc tgtgcgtg 



120 
180 
240 
300 
360 
408 



<210> 
<211> 
<212> 
<213> 



56 - ■ 
567 ! 
DNA 
Homo 



; .1 



sapxen 



3 . . 



<400> 56 ••• •• ; - ::r ' '• ' ; ^ •' *'*'■' ; " •• ' ' ' ; ' \ 

actgtgggtc gaagt aatgg ■/ atacggacgt r aStbcat^cttc gccgecgctg - ctgtagctct j 

tgccatcagg atggaaggca acactgttga taggtccaaa gtgacccttg actettccaa 

actcttcttc aaaggccaaa tggaagaacc tggcctcaaa cttgccaatc ctggtggagg 

ttgtggttac atccatggct tcctgaccac cgcccaggac cacatggtca tagttggggg 

agagggcagc tgagttgaca ggacgttctg tccggaaagt cttctgatgt tcaagagttg- 

tggagtcaaa aagcttggct gtgttgtcct tggacgcggt cacaaacatg gtcatgtccei 

tggataactg gatgtcgttg atctgccggg agtgctcctt aacattcacc aaeacctctG- 

cagacttggc actatactgg ttgagctctc cactctcatg gccagcgatg atgcactccc 

ccaggggtcc ccaaacagca ctggtgattt tagagtcatt gcaagggatc ttcatgtagg 1 

gctcattggt gtcaatctgg ; etcggat ' - : • - * : " ' ; •-' ' ' ; 71 



60 
120 
180 
240 
300 
360 
420 
480 
540 
567 



<210> 
<211> 
<212> 
<213> 



57 ■> 
411 - 
DNA 
Homo 



sapien 



<400> 57 

acccttcctt gtccgaagga gctgaccagt attgatgaga gagtccaggc agctcctgaa 

gttcagctgg tagtttgttc tctgaacatt tggtctcttg aaggcacagt ata£ctgggg 

cttcttcctt tacccaatct aatcctttct tcttaatcca ggctcgaagc ccatccacat 

tccaagagca gatcttgagt gtggcaggtt tgccactggg tgaggttttc tgatetgggg 

ggtcctcata cagggctggg ccctctcctg ctgcctcttt gtcatttttc tttgcggccg 

tcttactctt cttggcctct ggctctgtcc tgagctcatc cccgtcttcc gccaccgctc 

cctttttccc acgettcggc attcccgtta cgaacgecct tgggcagctg^t - 



3 



60 
120 
180 
240 
300 
360 
411 



<210> 
<211> 
<212> 
<213> 



58 
589 
DNA 
Homo 



sapieri 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(1) . . . (589) 
n — A, T, C ox G 



<400> 58 

acattaatac aaacatactt ' gcagtetgag cgaagaiggg aatggaggct: gaggaggtca 

aaggacgaaa ggtc^gccct aaagacagggM tgttttgtta ttatggtaat tacaccttca 

taccttctat: .aatattcatt gacagacggt ..gacatcaaca ggtgtagttt atcatgttct 

gtgtagagaa ctaaactacc ctactgtatt tgccatgccc ccaattccaa gaaaacggca 

aaaaattagc ccatcccatt cctcatcaca aagatcttaa ctgcacccct gcaacacaag 

acttttccaa taggacaaaa cttcaaacag cattgtatac caaatgattg cggatcaaaa • 

ttaaatttac aggaacacaa tactgaagca ctccactgtt gctgtaaaaa ctgctggaaa 

cagaatctgt caactggcca aattttatcc ttaattatta tccaaacagc cgtcctcttc 

acatctatcc ggatgatgct aatctactac cctgtccact aggttagcaa gttgtaggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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caactcttca ccatttctcc caccctaaga ggtacctgcc cnggcggnc 



<210> 
<211> 
<212> 
<213> 



59 

440 v 

DNA 

Homo 



sapien 



<400> 59 

acatgaggca gttgagcagc actggagaac cttcacggtc cacacggaac tccccagttg 

gagtataata gtcattctcc ttgatatgtt tgcctgtatc tgtgctccct ccaatccgga 

ccatccaaag aaacttgttg atatcatcag aggaataccc agtgaggcct ccaaaaatga 

ccagcacata gctgacatcg agctccctca tgatctcata ggctttttcc tctgtggacg •> 

ccattgcctg ccctactcga gaaatatggg tattattcca tgtgttattg tccactaaaa 

ttgttcggtt tgccatagct gtaatctgat agccataatc ccaccaggac atgaccttcg. 

catcctctgg agtattatga c.gaagQcaat ; aa£atgcttG tcggaagtea tcaaatatga- 

tc.ctactgcc , atc.Qccacca;, :> : . . •- , ••: ; -. ? - — —.;■,.-£' 0 :>■■:<' --~ ■■ 



<210> 

<Z11> 
<2X2> 
<213> 



60, 

41£; : 

DNA ;. ; 
Homo 



sapien 



acctggaaga tcaagatcta cagctgccta tttccacatc t.titcaatcca; jtcfcggctcct 

taaatagggg aaaaagccct tatttggtgg agaagcattt ccaaaatgaa gttacaggtt 

ctattaaaac ttactgtcac atcaactgtt aaaatagggc cttttgtgtt ttgttatttc 

accttaatat caccagaatt cctgtaattc cacaattgtg attttactat gtagaagata 

attcagttct agtctattgc tttagatgta aaaacagctg aaaacccaaa gtggattaga. 

attgctgaag gatttccctg ccgttgtttg atacaatcta ttctcttgat tcttgatagg 

tgcatagaaa.gcctaactta aaattctttc tacaggaaca tgtctgattt caggagt 



£210> e:u 
<211> 354, 
<212> DNA i 
<213> Homo 



If J 1 



i 



sapien 



! t.9- 



<400> 61 .. . . j f . : . 

acctcctgtg ttgcagagtt tctfcfcatcca oa.tcoaccca accagcagca tcagecacag 

gactggtctt gaggacatct ggtgggctca ttggaggtgt gacatgaagg atttcatatg 

aaatcacttg ggtctctcct ggtttgtcca ggttctcaaa tacagcctct tgtttatcgg 

ctcggacttc aatgaggttt ttcttgtagt taacagtgag gttccgctcc tggatgatct 

cctgcagggc atctgcatac ttcttaaccc cgaaaatggc tccaagagaa gtgttgaaaa 

tgatattggc cttggatcgc ttccctgtct tcctgaagta ggcttctgat aagt 

<210> 62 

<211> 205 

<212> DNA 

<213> Homo sapien 



<400> 62 

acccccttcc-acttcg^ctc ccctagctcc ^agaagcaac^cactgatgtg atttc.tacca 
aatccagttt tggtGCtact -aaatatactc ttttgagaetv^ggectctttt-aGteaGcata 
atgcctt tgt . aatt catcqa tgctgttgt^g tgtatcagca i'gttfegttCGt "tttcattgct 
gagtagtatt rjctattgtaga gatgt ->:y> j r ; • < . = • ; 



589 



60 
120 
.180 
24.0 
300 
360 
42Q 
4 40 



<210> 63 

< 2-1.1 > 325 ; 
<212> DNA : 



t ; 



n 



: J 'a .: r: J \ . • 



<6a 

120 
180 
240 
30,0 
360 
417 



• .6 0 

120* 

180 

240 

300- 

354- 



i 1 >■ 



60 

■lzo 

.:180; 
205 
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<210> 65 

<211> 373 

<212> DNA 

<213> Homo sapien 



60 



<213> Homo sapien 

i. , - ■ ■ f .. , 

<400> 63 . - j 

acacacgggt tccggatcaa tgctcgggcc aacgccactg cctgtcgctg accccctgac 

agctggctcc cagcctcgtc tacctctgtg tcatagccct gagggagtcc agagatgaaa .120 

ctatgggccc cagactttac tgcagcagct gtgatttcct ccatagttgg cttctgggtc 180 

aggccatagg caatattttc ttgaagactt cttccaaata cctgtggctc ttgtcccact .240 

gcagccacct gcctgtgcag gtagcggtgc tcatattggg gaaggggctt cccatccaac^ 300 

agcagctgtc ccccggtggg ctggt 325 

<210> 64 - : .; ■ ; . : : . ■ <;■■.. . ' '• - . " • ' 

<211> 599* ■ '•• : V- ?:~ *= . ' . ; . >_ . . . .-. .->;■'->:- -■ --r 

<212> DNA -..}-,'■.. r v V - ;'V> ; ^-' , *.. : : ? : r ■> > c - «. • 

<213> Homo sapien- . , p. - • £ : * " : - • ^ - ■ v: 

<220> 

<221> misc_feature 

<222> (1) . . . (599) ■ 
<223> n = A,T,C or G 

<400> 64 

actttgatgt ttgaacaacc ttttcttgat cacttcttcg caataaaaat atgacatatg - 60 

tagtaaacct itaaaaaattt. cgtg.taactt tatggctcta ,cgctggaatt ; cttctgaagt s r. 120 
gagtaatcat' cacaatcatc: tttagtatat raatggatcaa aatgacacga ttgcaaatat 180 
tgataacaca- cagttataaa aggtgaaatt; ctattgggaa cacafctctt agtgagatag > 240 
atggggctga cccaccaatt aattcattta tctggatgaa , tagttcctac tggtagatta 300 
acagggttca ttttcaatrtc tgttgttttc acagatacaa gtgctgagaa atggttttac ■ .- 360 
ataaataggt gagaatgota gtagttttgt , tgtaagcatg tcaatcaatc gtttggtttc 
tttccgagtt gcatgccaaa aaccaaatag tgttccttca tcagctgaca attcatgggc : 
c'accattaat tttgttgaaa - gcaaagaacfc ^ggaaaccatc tgacttgaaa agaatttggt _i ; :.. , 
atcctggtat tagaggcatt cactttctct agngactttt aattatacta attactctc -599 



420 
4&0 

54=0 



■ ■■ i 



<400> 65 

acattaaagt gtgatacttg gttttgaaaa cattcaaaca gtctetgtgg aaatctgaga 60 

gaaattggcg gagagctgcc gtggtgcatt cctcctgtag tgcttcaagc taatgcttca -J ;>:120 

tcctctctaaJ taacttttga tagacagggg ctagtcgcac agacctctgg gaagccctgg 180 

aaaacgctga tgcttgtttg aagatctcaa gcgcagagtc tgcaagttca tcccctcttt 240 

cctgaggtct gttggctgga ggctgcagaa cattggtgat gacatggacc acgccatttg 300 

tggccatgat gtcaggctcg gcaacaggct ccttgttgac actcaccaca ttg.tttttca 360 

agctgacttc cag " ' 1 « '• ' 373 

<210> 66 

<211> 520 

<212> DNA 

<213> Homo sapien 

<400> 66 

a'cgtgagcca gtcatccata^ cactaaggdo tagttgagaa aaacctttga ttcaggatgg - 60 
ctgggttact aaccttgaaa' tgtaagagat' ctggfctttga atgtaaaagt tgcaacacac . , 120 

aaacggaagt. cttaaaaact ttttgctctg 'gtcagttaca ggtggatccc caafaa'tctg 1* 180 
tttttggttt - tctgatggaa ataatagaat taggggaaat caaatctggt tggtaggtgt . ;j- 2.40 

ctacagtatt agaagagggr ataagggcac tgtttaadao -taagttctaa tacttccaga ; 300 
aactgtgcat tccagatcta catactaaat gctcttatca' ttttgaaatg ggctcttgat : 360 
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taatagaccc atatttttta gtggcttcta tgttgtatat ttgtctaaaa tgaaagctct 
tttgcgttct aaaactacaa tatatgtcat cttattttcc ctgagtatcc aagtatagtg 
cagattctat gtaaaactac taaatgacac tggaatatgt 



<210> 67 - 4 

<211> 241 

<212> DNA 

<213> Homo' sapien 



<400> 67 

acagagatgg agaacgaatt tgtcctcatc aagaaggatg tggatgaagc ttacatgaac 

aaggtagagc tggagtctcg cctggaaggg ctgaccgacg agatcaactt cctcaggcag 

ctgtatgaag aggagatccg ggagctgcag tcccagatct cggacacatc tgtggtgcfcg 

tccatggaca acagccgctc cctggacatg gacagcatca ttgctgaggt caaggcacag 



<210> 
<211> 
<212> 
<213> 



68 
487 
DNA 
Homo 



i ■ 



sapien 



<4 00> 68 o - ; ; \. .N; - ... 
act ttgaggg ggtggt e -ttgggccect' -cctggcccag gagatg.tagaj atacgggtgg 
ccagcactgt gaac.tdgcag tcctegatga ■ actcgcaca^f atgtgacagc .cctgtctcct 
tgctctctga^ gttctcttca ratgatgct-ga :tgatgcagtc cacgafcagcg cgcttatact, . 
caaagccadd ctcttcccgc agcatggt-ga : aoaggaagtt cataaggafcgs.-gcgtgtttgcr 
gaggatattli ct'gac?acagg -gcactgatgg ccictggacaafc oaocacettg. aattcatccg 
' agatttctga. "cat-gasgga^ngagatctgat- teatgaggcg „gtcgatgctg citotcgctgc 
/dcgtcttaag gagggtggtg ; atggccagog : tggcaatgot -gcg.gfcttgaa tct^tgacca 
'Jggttctccag atccagatta oaagctfgtca . cagetgacgg atgcttcatgr gcaaccttat 
-'t'gagggt -j j-... -; s o 77 ;M r . - ■ \ 



<210> 
<211> 
<212> 
<213> 



69 
415 
DNA 
Homo 



sapien 



<400> 69 

actagcttca agaagctttt ggtoagctaciatttaaagg.c acaatagggc ctttggattc 

attgg-f tttt ca'ctgagtgg tttggaagta tctaaatcjggL actttttact 
cttactacca catccttggt gccaggagat? ttctcttgtgv atgaqaataa 
ccttgaagat gagatatatc cagaccttct tt.taggcgaa taaccactad 
atgtcaaaag catcatgaaa Jtaagtttat-a tacatatcca catccctcat 
aaccaatctt tcttaaatcc aaggacaagt gtgtttggct tcatacgacc 



tttgtgtgta 
atattccaca 
ttcttcttgt 
tccatattgt 
atctgcttgc 



aagaccagca gcctgcatca aatactgtgc accttctctc aagtcatctg catgt 



<210> 
<211> 
<212> 
<213> 



70 
535 
DNA 
Homo 



sapien 



<400> 70 

acatcatgtc tt;ataaggaa\ gecafctaagg .bcarctccacfc/gccatg.tatg caaotgptgt ? 
gtggctcgafc. atgatcaaca c.tgcctgtgg ;actggacggt: .goataggttt,; tggcaaccat,- 
eactattaca tattcttctt gtttttcctfe ,tccat!ggtat .gtggctggat.,ftatatatgga. 
tctttcatct -atttgtccag : ..tdattgtgQGw^acaacattca ^aagaagatgs ra.tt-atggact; 
tacctcaatc.agattgtggc ctgttcccct , tgggtfctta ; t Latatcttgat- ^ctagcaact 
:ttccatttct cafggtcaac. att'ttt^tta.-ttaaatcaac, .^etttcagat tgcctttctg 



420 
480 
520 



60 
12 0 
1.8.0 
.24 0 
241 



, 60 
•120 
18,0 
24 0 
300 
360 
420 
480 
4 87 



• - r 



60 
120 
180 
240 

. 300 
-^360 

415 



j 60 
120 
180 
240 
300 
360 



v.- 
3. 
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ggcctgacct cccatgagag aatcagcctg cagaagcaga gcaagcatat gaaacagacg .420 
ttgtccctca ggaagacacc atacaatctt ggattcatgc agaacctggc agatttcttt ,48 0 
cagtgtggct gctttggctt ggtgaagccc tgtgtggtag attggacatc acagt 535 

<210> 71 
<211> 249 

<212> DNA , ; . ..." v ■; 1 - t . 

<213> Homo sapien -r. : - ? ■ '> ' " ■ - ; ' ' - " - - 



<400> 71 . ■ « • •-• - : • . • 

agcgggacga ggatgacgag gcctacggga -agccagtcaa atacgacccc- tcptttcgag 
gccccatcaa gaacagaa'gc tgcacagatg itcatctgcfcg.. cgtcctcttc ctgctcttca 
ttctaggtta catcgtggtg gggattgtgg cctggttgta tggagacccc cggcaagtcc 18,0 
tctaccccag gaactctact ggggcctact gtggcatggg ggagaacaaa gataagccgt 240 
atctcctgt 



<210> 72 

<211> 297 

<212> DNA 

<213> Homo sapien 



....> .j 



'■ V'i." 



60 
120 



249 



■< : 400> 72^ •.:.,.«•; r : ! r :-, ,> ; ■: > -> . r. J • - ^ . ! " "•' - : • • • - -* : - " ' 

acacactgat t^tgcggcca igacaacacet atgaggtgaa gattgacaae agccaggtgg - . 60 

agtccggcte cttggaagac gattgggact tcctgecacc caagaagata aaggatcctg 120 

atgcttcaaa accggaagac tgggatgagc gggccaagat cgatgatecc acagactcca : 1&0 

agcctgagga ctgggacaag cccgagcata ; tccccgaccc tgatgotaag aagcccgagg 240 

actgggatga agagatggac ggagagtggg aacccccagt gattcagaac cctgagt 2 97 



<210> 74 
<211> 394 
<212> DNA ... ? 
<213> Homo sapien 

<400> 74 

actaaaactt acaataaata tcagagaagc cgttagtttt tacagcatcg tctgcttaaa 



60 



<210> 73 

<211> 531 ! * 

<212> DNA 

<213> Homo sapien r ; " - ~ - 

<400> 73 

acttgtcccavctcctgttca- gaggtcacat gcttat-ccaa aaactctgcc atcccaatgc 
ccattctccg gcaaatgtcg gcaatcactg -tttggtattt ;;ctcagccaga tttctaaact -r 120 
caagggagat cgttgggaag tcctccagca cctggcgatc cttctccttg ctctccatga ( 180 
accgccagtc tggttggtaa aggaaagagt gaaagttgtg taacagcggg aqettcttt.t ."-240 
ccacactgat ggtcatgtca tcttccagtg tgtccagagc vtcggagaacc ; agataaaata r 300 
tgcacactgc gttgcgcatt tccccatcca gcgcctggat aacagctgcg aaactgcgac 360 
tggtctgatt gagatacttg tagcaagttt tcaggctgct gctgagcgag tcctggtcca '^20 
tcttgggcat caccttccgc ttgcccccga tccggaagcg caccaggttg tagaactctt 
cggggtggcc aaggcatttc acgaactcca tcctggtgca ggcggcggac 



t 531 



60 



agctaagttg accaggtgca taatttccca tcagtctgtc cttgtagtag gcagggcaat 120 

ttctgttttc atgatcggaa tactcaaata tatccaaaca tctttttaaa actttgattt :180 

atagctccta gaaagttatg ttttttaata gtcactctac tctaatcagg cctagctttg , 24 0 

ctcattttgg agcctcacta aaataacaga tttcagtata gccaagttca tcagaaagac 300 

tcaaatggaa tgatttacaa aatagaacac tttaaaccag gtcagtccta tctttttgta - 360 

gctgaaggct atcagtcata acacaatttc gcgt 394 
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<210> 
<211> 
<212> 
<213> 



75 
369 
DNA 
Homo 



sapien 



<400> 75 

acattggtga tcggagtata gttggagcgc tttgtcatga tttccaggtt ggctttgtcc 

acagctatgt tggccaatgc accttgagcc tcaaagctgg caaatcgtcc aaattcttca 

agccgccaga ccgtctcctt ctttgccata tccacatgga aaatctcatc accatcaaag 

tcaaacataa actcgcctga ttggtcagga ttcagataga actcggcctg gatgatcaca. 

tgttcttctt ! tgatagccca tgattcctga gcgctcatca ; gca'cagctat .gatgaaaaat 

cctagcacag ggactccact tatggccatt . ttcttcttgg ■. gcgctctgtt gggagtcagt 
agagctcgg* i j • ■ ~ • ■ . > v . , > . . ; - : ; • - : . 



<210> 
<211> 
<212> 
<213> 



76 
384 
DNA 
Homo 



sapien 



<400> 76 

acgactcggt gctcgccctg tccgcggcct tgcaggccac tcgagcccta atggtggtct 
ccctggtgct gggcttcctg gccatgtttg tggccacgat gggcatgaag tgcacgcgct 
gtgggggaga "<3gacaaagtg aaga'aggcec gtatagcdat gggtggaggc ataattttca 
tcgtggcagg tcttgccgcc ttggtagctt ^gct'Cctggba r tggceatcag attgtcacag 
1 acttttataa 1 ecotttgate cctaccaaica ttaagtatga gtttggcccb gccatcttta 
ttggctgggc agggbct^cd ctagtcatCG ■ tgggaggtge actgctctcc tgttcctgtc 
ctgggaatga "''gagcaaggct gggt ^ ^ ; 



<210> 
<211> 
<212> 
<213> 



77 
291 
DNA 
Homo 



sapien 



<400> 77 t 

acgtggcagc catggctccc tfcc&eaaget gtaggtcctg -gt gggacagc .tggct.ttggg 

gaagcttgtc tttdtgggtg acccatggat < gctgcagaac otgcbtagct gbgagg.cgcb 

^gtggggatc 'cacgtgtagc atcttggaca ccaggtcctt ggotgtctct t .gaaactgtgt 

tccaatttcc ccc&etgagg gtaaacttcc cactgccga* cogggttagg attbcctctg 

gtgtgtcact gggaccgtbg gcaaatggag . tatabcctgc-cagcatggtg *.t 



- ! 



<210> 
<211> 
<212> 
<213> 



78 
242 
DNA 
Homo 



i -1 



sapien 



<400> 78 

acccatattg ctaatgctag gatcaagata ccacatagcc agaacaagaa gttgaaggta 
aacatagaat attttataca ggcactcaca cctgccattt cggaaaagga ttaggaatcc 
agatgccgtg aatttaacta ttcgttacag gcttgtcctg caatatgctc tggagcaact 
tgcctgcaga gatttctgta tccacggaca tttaaatatc gcaaaggcta tctccaggca 



ag 

<210> 
<2ll> 
<212> 
<213> 

<220> 



79 .-. 
449 
DNA 
Homo 1 



sapien 



60 
120 
180 
240 
300 
•3-60 
369 



60 
120 

iiao 

240 
.3.00 
360 
384 



; : 60 
120 

.. 18 0 
240 , 
291 



60 
120 
180 
24 0 
242 



BNSDOCID: <WO 01 96390 A2J_> 



WO 01/96390 P<h7US0i/18577 

23 



<221> misc_feature 
<222> (1) . . . (449) 
<223> n « A/T,C or G 



■' i ! 



<400> 79 

ngtacagaca aaactacaga cttagtctgg tggactggac taattacttg aagganttag 

atagagnatt tgcactgctn- aanagtcact atgagcaaaa, taaaacaaat-aagactcaaa 

ctgctcaaag tgacgggttc-ttggttgtct ctgctgagcai -rcgctgtgtca afcggagatgg 

cctctgctga ctcagatgaa gacccaaggc ataaggttgg, gaaaacacct catttgacct; 

tgccagctga ccttcaaacc ctgcatttga accgaccaac attaagtcca gagagtaaac 

ttgaatggaa taacgacatt ccagaagtta atcatttgaa ttctgaacac tggagaaaaa 
ccgaaaaatg gacggggcat gaagagacta, ateatctgga aaccgatttc agnggcgatg 

gcatgacaga gptagagctc : ggnccagcc » ; . J„,_ , : : , r 



.60 
120 

180 
?40 

3 00 
360 
420 

4 4:9 



<210> 80 

<211> 490 

<212> DNA 

<213> Homo sapien 



<220> 

<221> mis cofeature 
<222> (1) - . - (490) 

<:Z23> n =;-A>T,C or G i - - ■ . • ■ * \ * ■ ^ - ' ' . -: ; . <\ ; ' ; 

<400> 80 - ..■••■> . ■■ ' - . 

acatttcctt 'gnagactctg ntaatttcct . gcagctccfcg gttggttctg gagcagatga 

tctcaatgag agagtcctcg tcggttccca gccccttcat ggaagctttt agctcanaag^ 

cgtcatactg agcaggtgtc ttcaataggc ccaaaatcac cgtctccagg tggccagata 

aggctgactt cagtgctgat gcaagttcct ttttggtcct tctctggtag gcgaaggcaa 

tatcctgtct ctgtgcattg ctgcggntgg tcaaaatgtt gacaatggtg acctcatcca 

cacctttggt cttgatggct gtttcaatgt tcaaagcatc ccgctcagca tcaaagntag-. 

tataggcttt gacagaccca tatgcacttg ggggtgtaga gtgatcaccc tccaagctga v 

gcttgcacag gatttcgtga acagtagaca ttttgaagga agctgggccg tgcgccgaga 
gctgagagcg 



1 ■ i*- 



<210> 81--— o ■ - * -v,..- p- ■ , ■ ■' 

<211> 339/ . >, ■ -■■ > ' ■ :: - 

<212> DNA *■* / ; - - : ' • ~ " - : — V 

<213> Homo sapien . , ; : r 

<400> 81 • • . ' v : 

acagtagtaa ctgatgtccc cttcttcctg 1 gatgaatgag cagataaata ttgatgtcag 
catccttgaa ccatatcaaa gtgagcagtg tttggctact gcttctattt gaaatggtgc 
tgtgttttgg ttgtggtctg aagctttgaa gcgctactta gcatctcctt tcttccatgg 
agctctcacg attcaaacat gacagatttg gtaaaatgct ggttaggttg agtcttcctt 
gcccccactc agtcatcttt gtatgaatcc catgatttgg gggttttttt cttttttttt 
ataccagttt ttagctggtg tttatgaaga acagtgagt 



<210> 
<211> 
<212> 
<213> 



82 
239 
DNA 
Homo 



sapien 



<400> 82 

caagaacagc taaaatgaaa gccatcattc atcttactct tcttgctctc ctttctgtaa 

acacagccac caaccaaggc aactcagctg atgctgtagc aaccacagaa actgcgacta 

gtggtcctac agtagctgca gctgatacca ctgaaactaar. .tttccctgaa- actgctagca^ 

ccacagcaaa tacaccttct ttcccaacag ctacttcacc tgctcccccc ataattagt 



60 
120 
180 
240 
300 
\330 
420 
480 
490 



:60 
120 
180 
240 
300 
339 



60 
120 
180 
239 
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<210> 83 , 

<211> 528 

<212> DNA 

<213> Homo sapien 

<400> 83 - T : . . . :■.,-.:>..•.• 

1 acattcgtta ttttaaatga-'-aca'agt-ttac' aaagtttatt ttcatctata ! .cgtaa£gatg . f 60 
-atttttttaa aacfcttttac : atattagtgg ttatgatcca atgf gtbattf* agtgaattta • ' 120 
actgtaaggt ggtttaaate aaatatgcaa tgttta'cttg- aattgtattt ctattagcag ' - 180 
attttgacta • t ; gtttacagg acggtttaaa ttaaggatta: tcaggtiatgt ; gSgatctttC - 240 
' agttatct-tt" aaagtagatg "tatattaagg. gcttagattt aggat^ctaca tattctgggc ; 300 
attgaatagg cagtaactta caaataagtt ttgcttacct t.ttgtt'ctag ggactagcac ; 360 
ligctatcaat ggaaagtatt tttaactaat ctgttattaa gaaagfrcata' 5 tfcfcttgcatt -420 
tcagccaaaa taaagaccgc ctgtaataat ctgttagaaa cagataatac atgtctgaaa 480 
tccatatgtt tcatatgatc taaactgtat tttccaattt aaattaaa 528 



'V 



<210> 84 

<211> 249 

<212> DNA 

<213> Homo sapien , * : 

<400> 84 .<.";:.■£>; \ . , ; : 

acactgaagc agaaccggaa acacccagga actgttcaga aatctcagaa gaaatctgct - ' 60 
tctcttcgat ggaaagatat aattaacgat caaagagctc taagaaaatt gcaaagaagc 120 
cttaatgttc aagctttaga aagatcagag caatttttct ctttcagtcc aaactaagac - 180 
tctctgtatt taaatetctc - tggggbaaga gggctagatt tcctcatttt; £ttatgagaa ; 240 
fcagattggt'- ; .> j 249 

: * •". '- : > r:'> : -> -'\ L . ;c • - = ■ „■■:■.':■>.'.'■<■ -'■ / , 

•'<2l0> 85- .- -v" . ^ V ~ ' : ?0 . .>'.::<,.;•» • ' - • • - , '.' > 

0<211> 4 96 - - „• 1 .j ~> . i ; .', ;. '.Z'i' :•: ; , ; .... . ., ,' ' \ .. 

0c212> DUA - ; v... v.:/ 
''K 2 1 3 > Homo "sapien : 



• u0 7..' ; pj y ^» --f £j type p f: . " 7 . :• i.". ' - I o' 'f'Or' •• .:.v . ;. 

*400> 85 - . ^ - - 

actggccctc ggtgctggca aaggtgtagt tccactggcc gagggaatca agacatagtg 60 

gtccttctgc taagecaagg gctgccacaa tgacacagta gccagatcct gcaattccaa .120 

tgagagcagc caatacagaa gaaaacatcg cacatcgttt gccacagttt tcatggccac : :180 

agcagccaca gcagtcatcc tgttccagcc caatgaagac aaatgctggc aggagcatca 240 
gcaggccacc tcctacgatg ccagaaaaga accacacgaa geggctgagg tggttttcgg H -300 

aggcatactt tgtttcccca ttgggaaagt aaagcaaaat attaaccgeg atgcacagga 360 

gggcgagccc caccagagaa tgtccgatgc ategtgeaca cttcccatag cacatggtgg 420 
tetgetaggt tttctccccc ttctctttgt cttcagctpa gtgatacccc aaattagatg . 480 

aaagtgtgcc cttctg 1 , ; j .* ■■" l ■ 496 

<210> 86 • ' • •• . . - " ■ . ■ *. . 

<211> 199 . - v , 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1)...(199) --j" 1 *' 
<223> n = A, T/ C or G 

' * - 0 C '• - : 

<400> 86. • . .v.:.. . r _ r~, . ;.t.;^ . i ,j : • , ; . ,: 4 - . • : = ^ r : 

acagaaagag taagataaaa acat.ttaata tna-ttaaatG, ; taatttgcaa: aaattggtat „ 60 

clgacatttg ttgtgtgctci.ttgcaaagag.i cgcataggac^att ; tctgcag' caatcaaaaa. 120 
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ggtaaaatct ttttaaactc agatttcaag tttcctctaa tattccttct aa-tcctantc : 
cctggaaata ctttcaagc 



<210> 
<211> 
<212> 
<213> 



87 
436 
DNA 
Homo 



sapien 



180 
199 



1 t 



<400> 87 

aacgttttga tttcatgaag gtgttctcaa atttaaagca cattttcagt aagaacaaaa 

atatttaatg tttttatctt agacttaact tgatacattt gcatattact atggaagtta 

ttcaccttgt ccctgttttt ctttaagata ttttaaaatc atagttatac tacagtcctt 

ttttaaatgt atcctgatac attgtaaaat attttaattt cattgtggaa aataatgttg 

gataaggaga tatttttcac tgttaacttt tagcccatgc attttcataa tttafcttttt- 

tcacttgctg ctttatatga catatgtgac atttgattat ttaacacttg atgtgatctg 

cataaaccca agttgcacaa ccctcctgct gaagataaaa ttgaggttaa agataaagat 

ttattttcat atttgt ' 



r i 



<210> 88 - 

<211> 596 '- r 

<212> DNA . -■■ ■ 

<213> Homo sapien" 



(. -r:i. 



J o 



60 
120 
.180 

24:0 
300 
360 
o420 
.436 



' r 



<220> 

<221> misc_f eature 
<222> (1) . . . (596) 
<223> n = A, T, C or G 

<400> 88 

acaaaagctg gtaatggacc aaagacttcc aaaatatatg tgtaatgacc tccagatttc r ■< 
tttatagttg ttcecaattc^ agcataagac aaagctccaa -atagtgacag-gaccccacac , 
acegtccaga tggt cagaga - eatgcccacg- ct gcapgtgtp: -irctggagcac gccctt aggac .?j 
gagatgaaga ttcctgctcc aatgatggtg ccaatgataap t^gagact cc ■ cetcagtaaa ; : 
gtgactttcc tcttcagctg cactttctcc tgcccaggtg^ctccfctgtt gcccagggaa - 
ggcagcctcc cgttaacatt tccctgcagg taacctcctt tggagatggt ggacacaaca 
ggctttctga ccatagtagg gacacacggg ggaaaaataa aacagaggga aagaaaacaa 
aactttcaac tttggtgtct cttggtgtta ctgatcgatg tcttcctctg ctttcagact 
gtctctctca gcgctatagt gttcacaggt gaaaactcaa aggtgtgctt tttncttcac 
agcgatctaa ttactactca gaaacacctg tgtatgcatc gtgctctcaa ttcttc r 



<210> 
<211> 
<212> 
<213> 



89 
435 
DNA 
Homo 



sapien 



<400> 89 

acacaagtca gtccaacagt tagtgttaat tactaataat atatgaaaac cctgccaaca 
caattgctgc tacatcacca atataattat taaccactgt cggaaaaaca cacataaatt 
caggtaagac taaaagctgt ctcacaaaaa gaaaaaagaa atccaatgga tccactaatg 
ctatcaaaag ggacatgcag gaatgtaaca tgacattttt agaaatgtgt gtttctaaaa 
agaaaaaaaa atacactaaa atgccagtgg actataattc attcaaaaca tctttagtgt 
tccttcccaa agatcttgat ctgctcagta attgcttcac aagatctatc acagccatct 
tttggagcgt atggttaggc tggtcctcct gtggtggtag gggcagtctt tttgaagctt 
taagtatctg gt'ggt 



<210> 90 
<211> 344' 
<212> DNA 



UJ60 



.180 
.240 
-300 
360 
420 
480 
\540 
596 



60 
120 
180 
240 
300 
360 
420 
435 
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<213> Homo sapien \ , ; . . 

<400> 90 

actcagcgcc agcatcgccc cacttgattt tggagggatc tcgctcctgg aagatggtga 
tgggatttcc attgatgaca agcttcccgt tctcagcctt gacggtgcca tggaatttgc 
catgggtgga atcatattgg aacatgtaaa ccatgtagtt gaggtcaatg aaggggtcat 
tgatggcaac aatatccact ttaccagagt taaaagcagc cctggtgacc ag.gcgeccaa 
tacgaccaaa tccgttgact ccgaccttca ccttccccat ggtgtctgag cgatgtggct 
cggctggcga cgcaaaagaa gatgcggctg actgtcgaac agga 



<210> 91 
<211> 371 

<212> DNA 'f " - .■ 

< 2 1 3 > Homo s apien 



; i 



SO 



agcaatgcaa aggacatctc caatcatgac atttaagaca attctttatt tQ^Qtgapag, 

tgacttcttg aagtgcacat ataataaata aatagaaaat atatctttgt tcatggtgat 

gcctacaaga aatgtttaca tacaaacact ctatacatct aactcccgaa aaaggaccag 

ctatttcggc aacagaaaaa agacaagcat ttcagaggag cgttgctttc cttaaagacc 

taactcactt aagtcttaca aacagaaata acaaggagga caattttcta agcaataaga 

aaatttgtgc taccaagaaa atgcctagat attggctctt ggtgaatggt ttaggaaaga^ 
aacttttatg t 



60 
120 
180. 
240 
300 
344 



60 
120 
180 
240, 
300 
3-60 
371 



<210> 
<211> 
<212> 
<213> 



92 
209 
DNA 
Homo 



sapien 



j 



<220> >:■ . i :■• 

<221> misc_feature . . 
<222> (1:) - . . (494) , 
<C223> n = A,T,C or G 




<400> 92: j - . ^ - ^ ; .., ... 

v acaacaaaag >atoaaapcca tgtceegafcg ttaacttttt- a.apitaaaag aafegccagaa .. : ■[ 60/ 
■ aacccagat.Gi; aactactttcc: agctacgagcj ^g.t^a^aaaogg,e^acecaa faggeeagtca : . - ; 120 
; gaetcgtgca ;gatcttattt.: fcfetaa£ag&a: 

j* ttcatattct r fccccccatiia jaacaccagfc^p.^-oo*- o:: *• i ;.• ci .:• r •* o;:dc fv; •* ni- 

<210> 93 - 1= .- -ft: ■ c ( ^- ■ ■ :- r: „■-■ - C 

<211> 176' ,- ' • • Vf • ■ .'* 

<212> DNA .... '• - j : - .,..*-:• ' ^ .'. 

<213> Homo sapien * >. (■ • , - r ( , . , - . ... -~ ., 

<400> 93 

actccctgtt ttgagaaact ttcttgaaga acaccatagc atgctggttg tagttggtgc <60 

tcaccactcg gacgaggtaa ctcgttaatc cagggtaact cttaatgttg cccagcgtga y. 120. 

actcgccggg ctggcaacct ggaacaaaag tcctgatcca gtagtcacac ttcttt , - 176 

<210> 94 

<211> 494 : . . .... ;!*■■•-•• 

<212> DNA ' • \, 

<213> Homo sapien f . - 



i - A 1 



<400> 94 

aaatggaaat ttaantgaca tcctanaggt agagaaaccg nggagatcnc ttttctcaga 
ctcaccaact tttaatggga tttcatgggg tttggttgtg ctgatagggt aaggggaggc . : 



60 
120 
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tgctttctgc 
atttggaaag 
gaaaattcta 
tttatttatt 
annatcacag 
ctatatgatt 
tacttcagng 



ccttctcccc 
gaccaaattg 
tggaatagtt 
tattacccta 
atgtaaatta 
tagcaagttc 
ctca 



actcccatct 
ctttacagtt 
ctgtatatag 
agaatgcttt 
nctcaccana 
tagaaggncfc 



gatttactta 
tttttctttg 
ggcacaagta 
gccataacca 
tttactgngc 
ccaagacaat 



attcagtctc 
cgtagtatct 
aaggcattgt 
catttaatgg 
ctgaactcat 
aattacattg 



agctgctgaa 
tgaaatcctg 
ccaaagttta 
gaaaaacggc 
tctcttcttg 
gcacaatgta 



180 
240 
300 
360 
420 
480 
494 



<210> 
<211> 
<212> 
<213> 



95 
260 
DNA 
Homo 



sapien 



<400> 95 

cgcggcgag^ tacgggcttt ceatctagtt gccagcttag atctg^ggtt- ggtaacccac 
tgactttgca gtccaitctg cagagttttc cttcttgaac agtcagatct ccaggagcct 
gcaagaagtg aggtct^aag aatcgctcct gaattggtt c attttcgtct ccactgtccc 
tltgatctaga acgaggcctt ctgacatgag gatggcctga gggagaccgg ~ 
ctctttggtt gacagcctgt ( ' ' 



60 
120 
180 
240 
260 



<210> 
<211> 
<212> 
<213> 



96 
438 
DNA 
Homo 



sapien 



<400> 96 

accagttctt 

tttaaaaatc 

aaaccccctg 

tgctgtgctc 

actaagc'ccc 

gcacattcct 

aaaagaagaa 

ct^aggaatt 



gtttatatac 
taatttcttt 
acagatctct 
ctgtggcact 
tcatgggcag 
cctacatggt 
aatcacaatt 
aaggig^t 



agtagtgttt 

atttgtgtgg 

ggatgatcct' 

aaadtccttt 

gtaatgaaga 

agatgtgcaa 

tdttafccaaor 



tgggcacacc 
ccttctagac 
bcttgaatcc 
tgattggttc' 
ttga'aaactt 
tag a tg t tilt ' 
ttrgta^c&g 



taaggtcgat 
aaacgaaggg 
i:gggcagttt 
tttctttcct" 
ttttctgttc 
taaaattgga 
aca ; 



<210> 
<211> 
<212> 
<213> 



97 
454 
DNA 
Homo 



ctgtgttgta 

gacccagagg 

ggtctctcct 

tcccagctag 

tccag;tgt-ga ( 

g aa t g a a a a t r 

caattgcatt 



' : 60 
120 

mo 

240 
360 

;*20 

438 



sapien 



<400> 97 

gagtaattcc 

gccttgagct 

atttcacggc 

tttattttgg 

tagctaccac 

tttgatcgat 

tccccctgct 

agttattata 



cctccagcac 
gggttgattg 
aatgagcctt 
gctcagaaat 
atcattttga 
atattairggg 
tttaggatta 
caatgtagtg 



tagagaccgc 
aggatgtgtg 
tttctttcta 
aattgctctg 
aaggaccatg 
tagctcttlga 
aaatcaaaga 
agtcctgcat 



t ca g t gc t c t 
aaaagctcac 
cactejaagat 
ttgaaaataa 
agcaactata 
ttttgttaac 
tacttctata 
gggt 



<210> 
<211> 
<212> 
<213> 



98 
226 
DNA 
Homo 



tactagatga 
a gag etc gat 
tttcttctta 
tcctttgtca 
ageaaageca 
attaagataa 
tt'tttatcac 



actcagtaac 60 

gcctgctgct 120 

tttaatgtgg ' 180 

gaaaagaagg 240 

taagaagtgg 300 

ggtgactttt 360 

tatagatcat 420 

> 454 



sapien 



<220> 
<221> 
<222> 



misc 
(!) 



feature 
\ : (226) ! 
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<223> n = .A,T,C ; or G 



<400> 98 

actaaatggt ggtctaggag cagctgggcg. natagcaccg ggcatatttt ggaatggatg 
aggtctggca ccctgagcag tpcagcgagg acttggtctt agttgagcaa s fcttggctagg 
aggatagtat gcagcacggt tcfcgagtctg tgggatagct t gccatgaagt aacctgaagg 
raggtgctggc tggtaggggt tgattacagg gttgggaaca gctcgt 



60 
120 
180 
226 



<210> 
<211> 
<212> 
<213> 



99 
333 
DNA 
Homo 



sapien 



<400> 99 

actcatctag 

gaatatcttc 

tgtgaaggtt. 

tcaggtgaag- 

agtatacttt 

caacatgatc 



a c g 1 1 1 a g g t . t a tlbfc 1 1 cgtgr- 
t ggaata t a c .Jie^t.c tggaa 
1 1 £ tggc a g c ; ,a t j^ocat aaa, 
ctgajgcagpt gtt^c^tgtc 
ttgttcccat ttgtttccag 
cccacagtca ctcagtgcat 



g tt ga gg a a g c t c c t c.t a c t 
.ajaagajtgcaai cctttccatc 
" t ag e t ag 1 1 1 ^ ta c a att ge c 
jt aa ajt cc ape t ggg t ag c c a . 
aaaatgcaat gtgtcttgtg 
ggt 



aaattcttaa 
attg.t tpttc 
actg<gatccc 
gtatgcgaag 
tteatt.ataa. 



. 60 
120 
18.6 
240 
30.0 
333 



<210> 
<211> 
<212> 
<213> 



100 
417 
DNA 
Homo 



sapien 



<400> 1Q0 

— i 

, aecgccacat 
, atgtcatcaa 
j^cecagtegg 
ra;Qctgaagaa 
rgpgggtttgc- 
ttgggaaagc., 
•tQttcaggtc 



egctgaett;^ ^ctggpaact ctgaagtcat. cctgccagtc ccggcgttca 
t gg c g g 1 1 c t , c a t g c tig g ca , ac a ag c t g gc c a t g ca g ga g 1 1 ca t g a t c c 
t g ca g pa aa c . 1 1 c a gg g a ag c ca ( t g cgc a t . t g g a g c a g a g g 1 1 1 a c c a ca 
tg.tcateaag ^agajaata-tg Iggaaagatgc. caccaatgtg ggggatgaag. 
tccc.aap a tp, c.tjg^ga^ga a£ a a a ga a gg cc t . g g a$f c tg clb g ' a a ga c t g c t a ^ 
t gg p ta c act- : &at aajgg tg.g ■. . .teat eg g c a t A . g g a cgt a g eft \ g c c tee gag t 
tgggaagtat gacctggact tcaagtctcc cgatgacccc!" ^gc.aggt V'. 



<210> 
<211> 
<212> 
<213> 



101 
438 
DNA 
Homo 



60 
120 
18*0 
"240 
V 300 

"■ 4177 V 



sapien 



<400> 101 

acatatgttt 

aactctgggg 

atagttcact 

cgttgcctgg 

efcegagtaac 

^ettteggta 

caagctcaag 

tccctgagga 



tttaagtaag 

aacagggegg. 

gaaggatgtg 

tctcaccacc 

atctgtcaga 

actttcactc 

aatatctccc 

atagatgt 



ttacttttac 
tetgecttaa 
atgaggctgt 
acagctacgt 
aacaaaatgt 
tctactttgt 
tccg;tagaat 



cattagaata 

caacccttct 

agtaagtctt" 

gcaca.tctgc, 

tgttggttga^ 

gtccaatcttj 

geccgaataa 



aacctagaca 
cVaggfetgag 
ctcatcatct 
ttcctcagca 
gcaccpaatg 
ggita^caaag 
cagtttctgt ; 



ctacagggac 
gaaggcaggt 
gttaatcctg 
gcacfcggcct 
ctgtctgcaa 
tgaccatcaa 
gcctccacac" 



60 
120 
180 
240 
300 
360 
420 
438 



<210> 
<211> 
<212> 
<213> 



102 
466 
DNA 
Homo 



sapien 



<400> 102 

acttaaaaag tggtttttct atcttcaaag tgctaaagaa acaagtattc aaaaagaaac 
ttcaggtegg tctacgaagt tctgactgac ttgaagtagt gaaataccaa gaatgcagtg 



. 60 
120 
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<210> 103 ' r ^ 3 
<211> 500 u 
<212> DNA •: ' 
<213> Homo saplen 

<220> 

<221> misc__f eature 
<222> (1) . . . (500) 
<223> n - A, T,C or 

<400> 103 



<210> 104 
<211> 422 
<212> DNA 
<213> Homo sapien 



J (■:-£.■ j: 



<210> 105 
<211> 326 
<212> DNA 
<213> Homo sapien 



acgaagtagg tccaaagttg ttgaccgtat ttacagtctc tacaaactta *agctcataa 

acataaaatg aatactgaaa gaatacttta caagcaaaag aagaattctt ctataagcat 

tccttttatc ccagaaacac ctgtaaggac cagaatagtt tcaagactta agccagattg 

gqttttgaga agagataaca tggaagaaat cacaaatccc ctgcaagcta ttcaaatggt. 

gatggeScg cttggcattc cltattagta aaWtaaaoa' ttttcagtat gtatagtgta 

aagaaatatt^'aaagccaatc atgagt ' • . . 



k210> 106 
<211> 543 



180 
240 



gacaaattta aaaggccttc attagaataa agtatatctt aactacattt tgcaaagaaa 

tgaagcaatg gttgcacaac cagtcagggc caagttagta acatacaact cagccatcag 240 

SSaLtctc cctcaaacta aactaatcta aatgtatttt tcagaaaatt tcctccatac 300 

Jccatgtatg tgttacatac atccaatcat atccatattt tggatcattt :ttfetctatat 360 

tcatcagatt attggttaaa atgcacagca agtagaaatg atccatttca aaattcttaa -.420 
tatctagcgt tctctgtaaa acaaaagctg acaacagttt tattgt 



466 



nggtgcagcg gagacagagg cggaagctgc agccctagag gtcctggctg ^^ggcagg 
catcttggaa cctgtaggcc tgcaggagga ggcagaactg tcagccaaga tcctggttga, 120 
gtSgtg^ gactctcaga agaa.gacaa gctgctctgc agocagcttc aggtagcgga, 180 



180 
240 



fecctgca^ aacatcctgg ctcaggagga cactgctaag ggtctegacc ccttggcttc 
tgaagaLtg agccgacaga aggcaattgc agctaaggaa caatggaaag gg<=tgaaggc . = 300 
ccccLcagg 1 gagcaegt'ag aggccateaa aattggccto fccaaggccc. tgactcagat , „ 360 



ggaggaagcc cagaggaaac ggacacaact ccgggaagcc tttgagcagc tccaggccaa «u 
gaaacaaatg gccatggaga aacgcanagc agtccanaac cagtggcagc tacaacagga 4 80 

gaagcatctg cagcatctgg 



<400> 10* ' ;.; .. ntnnnnaaa 60 



120 
180 



tqqttctagg adatatcalt aecaaaccaa . agaaagaaaa tattatagcf . tttgaggaaa 
tcltgaa^-tgfatggctc aatgatttcc' tgaagatgat aaagagcaag |tattgcaga 
taaaatgaaa gaagatgaaci catggcgaat aacagataat gagcttgaac tttataagac 
caagacatac cggcagatfca ggttaaatga gttattaaag gaacattcaa S^^taa ; 240 
tattattgtc atgagtctcc cagttgcaog aaaaggtgct gtgtctagtg ctctctacat 300 
ggcatggJta gaJgltctat btaaggacct accaccaatc ctcctagttc gtgggaatca,; 360 
tcagagtgtc cttaccttct attcataaat gttctataca gtggacagcc ctccagaatg 420 
gt 



60 

120 
180 
240 
300 
326 
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<212> DMA 

<213> Homorsapieri - ,a *■ , , 

, ' " - ' • ■■ ;■. - • .: ' • / V ' > 

<400> 106 - • ■■ , / ' '.'!-7^/:.'. 

acttgtaatt agcacttggt gaaagctgga -aggaagata.a; ataacactaa? actatgctat 60 

ttgatttttc ttcttgaaag agtaaggttt acctgttaca ttttcaagtt aa^tcatgta ,120 

aaaaatgata gtgattttga tgtaatttat ctQttgtttg -aatctgtcat, ^caaaggcca : ISO 

ataatttaag ttgctatcag ctgatattag tagctttgca accctgatag agtaaataaa 2 40 

ttttatgggt gggtgccaaa tactgctgtg aatctatttg tatagtatcc atgaatgaat : 300 

ttatggaaat agatatttgt gcagctcaat ttatgcagag attaaatgac atcataataq ; 360 

tggatgaaaa cttgcataga attctgatta aatagtgggt ctgtttcaca tgtgcagttt: 42 0 

gaagtattta aataaccact cctttcacag tttattttct tctcaagcgt tttcaagat.q >. .480 

tagcatgtgg attttaaaag atttgccctc attaacaaga ataacattta aaggagattg 540 

ttt 543 



. f; ( 



C t 



<210> 107 
<211> 244 
<212> DNA 
<213> Homo sapien 

"<400> 107 i . I 

■J- acaaaaatgg vttataaaat^ gttgaagcaa- ctagaagqgte ^gacaggtat^- -aitacatataa . , - 60 
- atacaaccaa <aattcaattci aatgcaaagt ; tgaatgaeal; .c^tafctgcac paaaatttat" , . "120 
tccatacaaa agcacatgca ^bcaagagttt i tcataagatg-aaaacaaaca,. cacttacttc V. .-. :180 
atagcatctt ^ accacttact; tacacaaata --gccQataaae^accatctggG ^attgtgattg ■- -24,0 
cagt ••• - -•- ' ^jJ: : ::■ • -> -- '■ .. •/ . . 2 4 4. 

<210> 108 . .. \. ! " 

<211> 511 

<212> DNA 

<213> Homo sapien 

<400> 108 . ,^ ... 

acttcatgtg atttgtcaac catagtttat cagagattat ggacttaatt gattggtata 60- 

ttagtgacat caacttgaca caagattaga caaaaaattc cttacaaaaa tactgtgta^ar ,120 
ctatttctca aacttgtggg atttttcaaa agctcagtat atgaatcatc - atactgtt;tg ** . "...180,v 

aaattgctaa tgacagagta agtaacacta atattggtca ttgatcttcg, ttcatgaatt .2 40:"' 

agtctacaga aaaaaaatgt, tctgtaaaat tagtctgttg , aaaatgtttt .ccaaacaatg 300- 

ttactttgaa aattgagttt : atgtttgacc taaatgggct aaaattacat t^gataaact v . 360. 

aaaattctgt ccgtgtaact ataaattttg tgaatgcatt . ttcctggtgt .^tgaaaaaga : 420 

agggggggag aattccaggt:. gcctjtaatat; aaagtttgaa gcttcatcca -ccaaagttaa 4 80 

* atagagctat^ ttaaaaatgc actttatttg t r. r ■ 511 

<210> 109 
<211> 652 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc__f eature 

■e<222> (1.) . .'.:(-.652i t, : , .... 

». ^223> n f A,T,,C or= G , . • : , f.^ " ; .. ; r ' l h "'. 

<400> 109 ' • " j ■:. ' o -jj... ■■ ■ . ' ' ' ; \ 

acaccccaaa* ctctc.cct.tg: .ggagcctcaa.r tg^gcagtata. tgtggctcct ggaggaactt ... '.. ' \ 60 
ggtagccctc agtatacaac ttaaagtgat gaatcagtga ctccatggaa gtcttcatct* V X?0 
ctgctcgctt aggtggagac actttggcat catcaacctt gatctcccca ggaggcatct ' ; 'i80 
tgtttagaca ctgtgcgata attctcaggg actggcgcat ctcctccacc cggcacaggt 240 
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300 



<400> 110 

acacattgag tattccacag atatacatgg tttaatatgt ggtatccatg gggtatgatt 
ctaccacagc cttgtaagtg ctccaaacct taaagt 



acctatcata gcagtcccct cgagaaccaa caggaacatc aaactcaaccj tggtcgtaaa 

eatcataggg ctgggtcttc cgcaggtccc actggatgcc tgagccccga agcatcactc 3£0 

cactaaaacc atagttaagt gcttcttctg ctgttacaac cccaatgtca attgtccgat 420 

ttcgccagat cctattgttg gtcagcaact cctccaactc atcaagccga agagagaagt ^ 480 

tcttagaaaa ctgataaatg tcatccataa gcccaagggg taggtcctgg tgcactcctc- 540 

ctggccggat ataagcagca tgcattcggg ctncagacac ttcgctcgta gaactcaaac . . 600 

atcttctncc tttcttcaaa cagccagaag aaaggggtca tgggcccaag gt < 652 

<210> 110 

<211> 96 > . - , , .u , . ■..;.,__..■.>.... - • ■. 

<212> DNA ; • :.. ■ ... , •: -. 1 . >. - 

<213> Homo saplen >>>..;:. ■ - . ■ . r : " 



60 
96 



60 



<210> 111 
<211> 371 
<212> DNA 
<213> Homo sapien 

■ ) \. .* i 

<4G0> 111 ,, ■ ■■ . . -i : * r ; .j ; . . : o ■ . : J - :.■ .• 

acatagcagc ttcataaqag : tttacttttt; taatataaag. atttttcaat : -tt-acacttgt 
aggagtagaa aaaactaata tgctaagtct gtaagctacg cagcaaaaat aatgatctta 120 
atgaagccag- aattctgtga; aaatgtgcac ■cacactgcat ata£ag.tagc ; tgagtaaatg . ,18Q 
taaaccatgt gcttattaac tcttctatat aaaatattga acccccaagt ctcacacatt .- , ;24Q 
gcctcctatg tccacatcac ttttctgaag acagcctcat gctttaagcc aatatatatt 300 
tgctatttga aaaagttctc atcctcatta ctaaaaatgt . ttctgtaaag gccttagaca 360 
tttttttcag t ' c371 

'x t <2 I. 

<210> 112 •>: * "{ - - ^ ' ! ■ 

<211> 406 
<212> DNA 

<213> Homo h sapiens riss.- * v\\»' > : ; i •. ~- . ! .f " - 1 - : ' 



60 



<400> 112 -v. ; ■ • . j - *' >' \, ' 

caggtacagt aatacacggc tgtgtcctcg. gttttcaggc tgctcatttg cagaaacaac 

gtgtcttctgraatcatctct tgagatggtg aatctgcctt gcacgggtgc agcgtagtct: 120 

gttgtcccac catcagttgt gcttttaata, cggccaaccc ac£ccagccc cttccctgga. 180 

gcctggcgga cccagctcat ccaggcgtca. ctgaaagtga. atccagaggc tgcacaggag? 240 

agtgttaggg accccccagg -ctttactaag- cctcccccag actccapcag ctgcacctca -300 

cactggacac catttaaaat agcagcaagg aaaatccagc tcagcccaaa ctccatggtg 360 

agtcctctgt gttcagtcct gatcactgaa tgaaaacact tgggaa 406 

<210> 113 
<211> 492 

<212> DNA - 
<213> Homo sapien 

<4 : 00> 113 \. : \ : , : .." : z.-. .-. * ' .!.,::.' . - ' ' ■ ■ ■ - 

accatcccca gaagtgtctg gtgccaggca. ctgatccagc agctcttcca caatggatga 60 

caataaccga • agctccccat ±ttcatcacg ctggct^atc tttgattgaa tgaaatctac 120 

aacttcctgg ctgctcatea cattceagat gccatcacag gcaatgacca ; tgaattcatg .; 180 

gtcgtcagtg agagtcagca ccttgatgtc, aggaagggct gaaatcatct gttcctcagg 24Q 

tggcaggttc ttgt.ttctct tgtagaagtg; gtccccaatg. gctctggaga .ggttgaggcc 300 

cccgttgact cgcccatcca tggtgacctt ; gccaccagca ttcttgatgc-. gtgctagttc. 360 

tacttcatcc tctggtttgt gatcatagga catgtctaaa gctttgccag cctcagatac • 420 



r 



.1 
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cacacagcga" gagtctcctg cgttggctac- aatcaactgc ttctctcgta tcagggccac 480 

daccgctgtt' gt - ■ ' ... v; . . . 492 

<210> 114 ^ ' ^ ■ 3. . s 

<211> 234"-- ■ r -J- ■: ; : * •= ? r . 

<212> DNA 5 - ■ '* ' - ; . 

<213> Homo sapien - . . j : 1 

<400> 114 

acctcagtgc aaaagttagt tgaactggtt cattcatctc tatggtaaca gcttcctcct -60 

ctttatcgac attactt^tc tgtgacaatt taatgtttcc atttccaagt tctccacttg 120 

cagaaaattt cactccjdftct tttgcacagg aaattacaac agcatctcca atatggctga -180 

gatctcggca tatac^tgca aattcaccag aaggcatctt tactacacag ctgt 234 

f • \ v. 

<210> 115 '■' ^"l >v .-. ■ - .. ~ .. i .. ;, r ■ ;f ; : . ~i •. f : . - . •• ' 
<2ll> 368 J\:>> ^ ■ ... -.^7 . 
<212> DNA 

<213> Homo sapien ; is. 

<400> 115 r.:c c .." 

cctggggtgg gatcagagga tctggcgtgg catcccgtag ccagtcatgc ct'gcctgaga : . SO 

\ cgccccgcgg ttggtgccca tctgtaaccc gatcacgttc ttgccctctt gcagctggtt 120 

atccgagaag ttccgaggat tctccttgga tttcttaggg aaccagttgg gatccccaga 1 .1 1*80 
gaagagccca toatctcggg. cctactgccag^ GccaCccaga-' ttcatcagcg^ ^ccgetgcac ■: ;:■ ~ : . 24 0 

acaggccatg ttctttcctt ccda^aggt^^ ca'cafcft'ttgg aagatgtcag t^gtgttaat - T c 300 

gccatagcgb^ tcagetgctt- "gcaggaact^g -> agagatfctgc tccatfctcfct ■ 'tgaaggccat - -360 v * 
ggtggagg - " .< ; - v; j ■ . - - & . '- . .-j , -. ..-v ;3 68 

" ; •' . - "i-"'., ■■ .-*:'■ /.U": : ■ - c v.1 ■ ; .3 ' ■ - , . * . 

<210> 116- — ■? : ■■ - r • V ■ -: ? v . £. - : 
<211> 487 
<212> DNA 

<213> Homo sapien ' '. 1. i : 

f : y '< 1 v 

<400> 116 AVIG 

ggatttttta ttgtgttttc cacatagata aaaaaataag gctttttgat gaaaagaatc *.: H r 60 

cattacaaag tcaaaaatcc attacaatta taattgaatc agtaacaaaa tttagcttta . 120 

aatgagtcaa gtattctgca tttgaaattt aatatcacaa acattcaaga ttagtgaatt - - X80 / 

ttggtaagaa^ aaaaatadta gaagaaagga aaaggacacc ttttcaacag atagtaattt . ;24.0 

ataaaaattt ttttaaaagt' gctttgggaa a:acacacagt; atcattactt aagaaaagtc J \ . 300 

atttaaggaa gacttaag4:g cttcaagtgg': agtgtattac: jagactaaaaa; atgtttfcaaa ^y- i 360 

atttgccaag aaattt'aagt gttaaaaata ctnttctcGtj%tattcagttt catgtttaag 420> 

gaaacatttg acagacaagt aaaccaaacg. caaaaaaaag ttcacttgea :■ ttttaaacta ;1480 

ataaatt u o. : •>. :l 487 

" 1 ■ ■ t t ■■ ■ * 

<210> 117 

<211> 430 . .. 
<212> DNA 

<213> Homo sapien . i ■ ; 
<400> 117 

gttttacttg ttgatttttg gatgcatgct gggggaggaa agcatattgt ttgtagtcac . ;.<60 

cctagagtgc taaggtatat tattccccag ■ fcaattctctc. ; aaggtfcf^rgcao tatgcaaaac : s 120. 

ataatctcta. aattcttcaa tactaagaaa-tacctttgtt ttacccrctaa aatcaaatgc . : 18:0 

cattttggct ggafcatagga f ttctaggatt : aaagccjtttt tccagcagaa ctttgaagac .24 0 

attgctccat ttacttctag catccagtgt g ; tccagtgat aagtctgctg ;» tcaacctgat 30.0 

tcttgttcct tggtaggtaa tttctcttctbCtctctagaa gcccttatta ttttctcttt • .3.60 

atcactagaa ttccaaaatt tcaccaagat gtg.tctagga.rgtcagtctct ■ tttcatcaat . ; ; 420 

tttactaggt . - ^ ^ r s. : ■ ■. . . i * , : • ■ . * ; > -. ... - ; - c • 1 ■ ■ ; ^ -430 
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<210> 118 
<211> 305 
<212> DNA 

<213> Homo s api en v 

<400> 118 \ ' ' ' " " " . 7 '•• •' - ' ' 

cctgctagaa tcactgccgc tgtgctttcg tggaaatgac agttfccttgt tttttttgtt 60 
tctgtttttg ttttacatta' gtcattggac cacag/ccatt caggaactac cccctgcccc : 1-20 

acaaagaaat gaacagttgt agggagaccc a'gbagcacct ttcttccaca caccttcatt 180 
ttgaagttcg ggtttttgtg/ ttaag^taat ctgtacattc tgtttgccat tgttacttgt - 240 

actatacatc tgtatatagt gtacggcaaa' agagtattaa tccactatct ctagtgcttg ''$&0 
acttt * : ' ' ' : ; '305 

<210> 119 

<211> 367 : ; : 

<212> DNA / 
<213> Homo sapien * J 

<400> 119 ; : ' ' '. 

cggtacaaga catcaaagtg aagtaaagcc caagtgttct ttagcttttt : ataatactgt J 60 

ctaaatagtg accatctcat gggcattgtt ttcttctctg ctttgtctgt gttttgagtc * 120 

tgctttcttt tgtctttaaa acctgatttt taagttcttc tgaactgtag aaatagctat 180 

ctgatcactt cagcgtaaag cagtgtgttt attaaccatc cattaagcta aaactagagc 240 

agtttgattt aaaagtgtca ctcttcctcc ttttctactt tcagtagata tgagatagag 3 00 

cataattatc tgttttatct tagttttata pataatttacr catcagatag aactttatgg 3 60 

ttctagt " ' " ! ' ' ; ' " " r;; ■ ' - • "3 67 



<210> 120 , 

<2ii> 4 or 

<212> dna; 

<213> Homo sapi;en 




<400> 120 



acaggtaaat aaaagatcac cttgaattaa actggatctc cttaagggca tagtatagtt 
tcagtttcat tacctattac ataattagtt tcttacatac aaatattgac atatttggct 
tgtgcttcga agcctttgtg tctatgaagt ccacatcaat gcagctcata actggaagtc 
actggggagt tctttgctgc tgctgggttt aacctgatca tgcattagag tctcctcagc 
acctgttgtg gctctgcaca cctctggggc atcgtcagtg tcaggatcca agccttcagg 
gcagggaagt ttcagcaact cttcgcggag ctgagcagtg tgacgcttga gagctgctgc 
atggtgagac atagtcctgc ctacccgctt atcactgctg t 



60 
120 
180 
240 
300 

3 60 

4 01 



<210> 121 
<211> 176 
<212> DNA 



< 2 1 3 > H omp , sapien 




<400> 121 



acagcccaga tgtgatattt ctacaggaag ttattccccc atattatagc tacctaaaga 
agagatcaag taattatgag attattacag gtcatgaaga aggatatttc acagctataa 
tgttgaagaa atcaagagtg aaattaaaaa gccaagagat tattcctttt ccaagt 



60 
120 

- 176 



<210> 122 
<211> 443 
<212> DNA 



— • ' $ ■ j. \ 

< 2 1 3 > Homo s apien 



<220> 
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<221> misc__f eature 
<222> (1) . . . (443) 
<223> n — A / T,C or 



<400> 122 
actgctgcca 
gtcctgcagg 
gctcaactgg 
tgggctttac 
^agactcaaag 
i c.acctcaaat 
g.tatctcctc 
. cacaaaacca 



<210> 
<211> 
<212> 
<213> 



123 
520 
DNA 
Homo 



gttccccacg 
agggaggagt 
ttacagacct 
cacctgccac 
aagaaaaggt 
cctftctgaa 
acaatattga 
tttcaagctg 



tggcccagcc 
cggtttccaa 
cgagttttca 
ctgggccaga 
ctfcgtctgat 
attatgtaag 
attccatgaa, 
aan 



ccacccacag 

tgccagccgc 

gcccatgtta 

gttctcttat . 

gta.taatctt 

atgaaaact.t 

ac cacti ate 



gctctcctgg 
cctaacaacc 
cttgaaggag 
ccttatcc.ta. 
aaa,ataaac,c 
aaatg.bctta 
tttgcatgpa" 



geccaggaat 
caggaactca 
aagcagttct 
aga^tc.ttt^ 

cacagttagc 
Jtagataccaa 
atgaagcatc 



60 
120 
180 
240 
300 
360 
420 
443 



f 



sapien 



<220> 

-<221> misc^-f eat are ^ 

;<222> (d;. . ^fai' 

<223> n ^ A r T , C 



or G 



i Z "i .-5- f -; :• • 



<400> 123' 
actgtatatt 
^agtatataag 
atatcaaaac 
taaaaatttt 
acagtccang 
cctaaggggt 
tccactgggt 
agctgatgag 
ctgattctca 

<210> 124, 

<211> 406 r . 

<212> DNA 

<213> Homo sapien 



- -t 



.ngaagattge. 
gtattagata 
atcatgttat 
aaaaaattat 
aaatatttaa 
ttctgtaggg 
gatgaganta 
ccaaagagaa 
aagtcacctc 



"taa,gataatg f 

tgttaattag 

ataccatgaa 

caattegtta 

agctagtcaa 

atcaccccca 

aggtgatgga 

acttcagcct 

agtaacttgc 



.ga.tttt.aagt, 
cttgatttag 
tatagacagt 
attttaccaa 
cagcttctac 
tctctagact 
ctgtcgatca 
gtgaaataaa 
ccaaggatcc 



gatctc.ac.ca. 
ttattctaca 
ttctgtcagt 
gttggggcaa 
agagatgaag 
tctacctggt 
actaggncca 
aacacttcag 



ca;aaaaaaga 

aggtatccat 

taaaagtaaa 

aagectttta 

aacattntgt 

aaacacgcct . 

aggcctgggt 

at'tagaangc 



<400> 124 

actaaaaatc 

tcttattttt 

gttttcagct 

aaatggaaat 

aattcaggtt 

aatgctgtgg 

tgttaagcac 



aattggatga 
actgtttata 
aaacttggtt 
actgtttttc 
tgatcaaggt 
aaatgaactg 
taatcagtat 



actaaatcca 
tatttgaatg 
tcctagaata 
cttgtgaatg 
gtgaataact 
tgtatatact 
aagtgcaact 



aaacatgaca 
ctgctacaac 
gactgttaac 
aattttcata 
gaagaaaata 
tctgggagga 
cctggttctg 



ctgtag'gcag 
agatgatctt 
tttcaaaatt 
tttgtaagtg 
acttgetgge 
acaaatttaa 
tacctg 



cagttttaag 
catccctgaa 
tttattggtg 
ctaagtttat 
tatataggaa, 
tcatttcttc 



><210> 
<211> 
<212> 
<213> 



125 . 
413 
DNA 
Homo 



- J.: .J- 



- 1 J 



60 
120 
180 
240 
300 
360 
420 
480 
520 

u V > 



60 
120 
180 
2,40 
3013 
360 
406 



sapien 



<400> 125 

gttttctttg aatgatttct ttttttcact gtaagacact cctttaaata atgectatet; 
ttaacttttt aagactattt ggaaaaatgc agtgtctcag ctgtccccag ggaaattaag 
tggaattcaa ctaagatctg ttaataagat gtcagaataa ctaataattt tattaggaaa 



60 
120 
180 
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aaatcatgtt ttaaatttca aaatgacact tatttgtcaa gtaatatgat cttggaaaat 

tttaaagaaa aataatccta cttataaact acttttttat aattgttttc agaaaaaaag 

tttacagtct taaggaaaat attcaggtct atcatatggt ttgacagatt ttttaaaagt 

tatttttggt aaggtcttct tttagaaaaa aattaatctc aagggttttt tgt 



240 
300 
360 
413 



<210> 
<211> 
<212> 
<213> 



126 
655 
DNA ' 
Homo 



sapien 



<400> 126 
gtattctata 
•gaaattgtta 
cctggggtgc 
tccagtcggg 
gcggtttgcg 
ttcggctgcg 
caggggataa 
aaaaggccgc 
atcgacgctc 
cccctggaag. 
c6gcctttct 



gtgtcaccta 

tccgctcaca 

ctaatgagtg 

aaacctgtcg 

tattgggcgc. 

gcgagcggta 

cgpaggaaagr 

gttgc€ggcg 

aagli.cagag.g 

etc cctpgt.tr! 

cccttcggga 



aatagcttgg 
attccacaca 
agctaactca 
tgccagctgc 
tcttccgctt 
tcagct pact 
aacatg.tgag; 
tttttccata 

tggegaaace 
,cgc\tc;tcctg 
agc^i^ogc 



cgtaatcatg 
acatacgagc 
cattaattgc 
attaatgaat 
cctcgctcac 
caaaggeggt 
caaaaggeca 
ggctccgccc 
cgacaggact 
.t^tqdgaccct 
tttctcatag 



gtcatagctg 
eggaagcata 
gttgegctea 
cggccaacgc 
tgactcgctg, 
aatacggt.ta 
geaaaaggec. 
ccctgacgag 
.ataaagatap'j 
gccgcttacp. 
ctt'cacgct-t. 



tttcctgtgt 
aagtgtaaag 
ctgcccgctt 
geggggagag 
eg ct egg teg 
tccacagaat 
agfgaaccgta \ 
catcacaaaa : 
caggegttte r 
gga tape tgt 
cr_taag ; . - - . ; - 



6t) 
120 
180 
240 
3Q0 
360 
420 
480 
54.0 
600 
655 



<210> 
<211> 
<212> 
<213> 



127 
442 
DNA 
Homo 



sapien 



<220> 

<221> misc_f eature 
<222> (1) . . , (442) , 
<223> n - AfT/C. or 



<4Q0> 127; 

accttatggt 
aaatggaaga 
cgtttagatc 
accacagtga 
catttgatca 
caaaatgeaa 
ttgcctcctg 
aggntaaaac 



cptitgaaagg 
gaacaagttg 
agacacagag 
acagagggat 
ggatttgatg 
tgatctcttc 
ttcactccat 
gctaaacctc 



J ..r 



' - *- - r - ■ v 
i* i / ^ v . , l * ■ - , 

aagactcaat 
acaataattt 
tgactgagee 
tctttgtcca 
cttcaaagat 
catttgtgga 
tttcaaggct 
aa 



■c J 



o 'J :~j £ V- O 



- — ■ I ,' 



fc'i " 



acttccagga 
gaagcaattc 
aatcaacagg 
aggcaggctt 
gacccactct 
acataccacc 
agagaaagtt 



gtcaaagtta f 

atgettctag ; 

catgtagtgt 

geagcteggt 

ctgtaaactc 

aacacaaacc 

caagtccaaa 



atttgaatga 
xjgctgaatga 
gatctttccc 
ccagcttgag 
attaccaaag 
acgcgtggct 
acaacagtta 



60 
120 
180 
240 
300 
360 
420 
442 



<210> 
<211> 
<212> 
<213> 



128 
447 
DNA 
Homo 



sapien 



<400> 128 . 

gtaaaatctg 

acatgataga 

aagagaagee 

gagagatagc. 

gtattgagat 

tqctgtttga 

aatpatttta 

tacctgccga 



atggtggtta 
gcatgtatct 
atttggtaga 
atttaaaact 
aaaattttac 
aaatatattt 
a'gtccaagac 
ctgetgacat 



aatgacgatg 
gtatttttaa, 
aataaagaag 
tgaaatcagg 
cacttctctt 
tactggtatt 
;'tcttatttt,a 
gtagtgt 



tttaggtttt. 
aaataaagac 
gagittgggg 
cacaacaatt 
ccctttaata 
gtgetttect 
catattctgc 



gataaattta 
agagaactta, 
aaggagatga 
agtatgtcat 
aattgtcaaa 
catatcacag 
aattaaaggt 



gattttatac 60 

rtgtttagaae. 3-20 

gaatgagtca 180 

gatataaaca , . 240 

ggataaagtt 300 

attggtaaag- 360 

cctatgaggc ,420 

447 
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<210> 129 

<211> 175 ; ; ' r : - : 

<212> DNA" ' ' : : * — ' ' ' - 

<213> Homo sapien l '" ' ' ' ■ 

<400> 129 

ttcagacttt gtttgtagtc agccttggtt tggcttcaga ctttgtttgt cgtatttgag 60 

gatataaata ttcatgaata gtttcccaag tctggagcga ccacataggg agaaaatgta . . 120 

aatgtctcaa tttttgttca caaaagtata ttttatcaaa ttgctgtaag ctgtg 175 

<210> 130 

<211> 406 ; ° 

<!212> DNA ' '■' ....;[. '>' : ' — ; ■ ^ :> ■: ' 

<213> Homo saf&en " r ' * - ,j- !; : - s: . .. • <■:: ■ . . .! :' ? 

<400> 130 : *■ .-;;=: , ^ i fc! ? n ■;:?•■: -.v.. : < - ■ \ o. , : : // . ■ ... 

acatttacat .tcaWg^ atgctagaga ■ 60 

actgaca€tt - ? cagacatggt T -catatatatg dtatttgaiit \tcctttatct 'tfajataca^at^ 120 
cttgattg^ : aa^^ tgtfeiccgaaa ct:cat£aaga M iM * : lpO 

tfaattgtatt ^c^t ttaggtctcg capgctgtct ^ 240 

atggcttgct; 'G^^gtgatat r Wgt^tcagac ^a^gta^ag'ta g&ag^caa% r c r ^^ctac'aa'ga" '"' : 30'0 
; \caagatctcV caa^ tttaaggcaV '' 360 

gttgtaagct cttccattat ttcaca'g^g' 'ctggctat^t ; ca^a% ' " : 406 

<210> 131 • 1 - 

<211> 403 , : -- ' - r ^ ■ 

<212> DNA 
<213> Homo sapien 

<400> 131 

. accgcattac attatgcctg tgaaatgaaa aaccagtctc ttatccctct gctcttggaa 60 

gcccgtgcag accccacaat aaagaataag catggtgaga gctcactgga tattgcacgg 120 . 

agattaaaat tttcccagat tgaattaatg ctaaggaaag cattgtaatc ctfc^tgacca ISO ' 

caccgatgga gatacagaaa aagttaacga ctggattcta tcttcatttt agacttttgg .240 

tctgtgggcc atttaacctg gatgccacca ttttatgggg ataatgatgc ttaccatggt' ^ 300 
taatgttttg gaagagcttt ttatttatap bat^tgttta^; tca^tcaa§t' ; tcaccStggc^ : 360 

cgtaatcctt ctaagggaaa cact'aaagtV gttgtagtcb : fcca v ; V : ■ . v : [ - 403 

<210> 132 ^' ■ ' - " L -' - V i- ' - 

<211> 479*^ : • ; - 7 ' '* " ! . '-:- r ? ' 

<2I2> DNA * : , - 

<213> Homo- sapien- : * ;r ' r 

<220> 

<221> misc_feature 

<222> (1) . . . (479) : 

<223> n = A/T/C or G 

<400> 132 

cgaggtacag ggggaccccc ttctcaaegg caccagcttt geagaeggea agggacaccc 6.0 
ccagaatggc gttcgcacca aacttagatt tattttct^t tccattcatc tcgatcatca " "120 
gtttgtcaat cttctctt^t tctgtgacgt^ tca^tttctt '|dtaaccago/ geaggegcaa : : '* ? - : I8b 
tagttttatt ga'tgtgctca acagectttg , agaeaciECtt , "ccdcatfata^f cgagtcttat * : -•" 24'0' 
cattgtcccg gagctctagg gcctcataga : tacfca4ttga*''agciccactg^^g^ - Yoo 

ctetgaagan accttttgag gtgaagagat caacct'caad :: agtfgggattc ccgcga^acft : ,: "' 360- 
daaagatc€c ect^ggedtgg atcttgagaa 1 tagacatggt gaadt'tctag' ccactgggtc - *' ■ *- 4"2 : 0 
tcgtcgccta 1 ^ ggagaggaag cggagggtgc J t:gcanacacc gaggtgaacg ^taaagcccg ■ 479 
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<210> 
<211> 
<212> 
<213> 



133 
301 
DNA 
Homo 



sapien 



<400> 133 

gtcttacagt 

atcaaaggac 

ctgggaggcg 

gtggccctgg 

ggaccagatt 

t 



gtgactcaga 
acgccaagtg 
gtggttacac 
atacggagat 
tcaagctcca 



ctccctatct 
tgtggaattt 
cattcgtaac 
ccctaatgag 
catcagtcct 



ggggatcggt 
gtcaagagct 
gttgcccggt 
cttccataca 
tccaatatga 



taggttgctt 
ttaacctgcc 
gctggacata 
atgactactt 
ctaaccagaa 



caatctaact 
tatgctgatg 
tgagacagct 
tgaatacttt 
cacgaatgag 



<210> 
<211> 
<2i2> 
<213> 



134 

494 ; 

DNA 
Homo 



i . ■ ■ t 



O F., 



<400> 134 

actaagtgta 

tattttagga 

taggggggcg 

tagcagagaa 

aaagttttat 

taattggctt 

atttaaaata 

tctgcaacct 

attgagatga 



sapien 



tacgtatttt 
aactgtaaaa 
gtgactgcaa 
acttggaaag 
acacagatat 
atttttctat 
taaatttatt 
ctaaaatgat 
aatg 



r .:.< 



Tie" 



tgccactttt 
gtaataggga 
ctgtgttagc 
ttctgtgtta 
ttgtattaaa 
ttccgtaact 
tatttatttt 
tgtgttgctt 



tcctcagatg 

aagagatttc 

agaaattcac 

ggatcttgct 

tttggagcca 

attgtaattt 

-tttaatagtc* 

ttaggattga 



attaaagtaa 
actatttgct 
agagaatggg 
ggcagaatta 
tagtcagaag 
ccacttttgt 
aaaaatcttt 
tcagaagaaa 



gtcaacagct 
tcatcagtgg 
gatttaaggt 
actttttgca 
actcagatca 
aataattttg 
gctgttgtag 
cactccaaaa 



<210> 
<211> 
<212> 
<213> 



135 
448 
DNA 
Homo 



t 



60 
120 

rao 

240 
300 
301 



60 
120 
180 
240 
300 
360 
420 
480 
494 

.••■"> 



sapien 



<400> 135 

actgaactcc 

actttgcatt 

ggaaagctct 

ctgatttgct 

ccactgtatc 

ctccaactat; 

ctctggaaag 

ctgggtcatt 



catcacaaca 
gggctctgcc 
tccactctgc 
ttcct^atca 
tccaagacca 
tctgtcaggc 
cacagctgcc 
ctgagaagat 



tcatcttcct 
atagctgctt 
acttgattgt 
tgaatcatcft 
aggtctttga 
tcttttaagt 
a'aagcagagc 
gttgatgt 



ctaataactg 
tccggagact 
tttctgacag 
aacoatttac 
gaacatgata 
ctctctcaat 
caagcacgcc 



taacacaaca 
catgatgaat 
agggtaagga 
aacctigggca 
accacccggc 
gactgtcacc 
agctcccacg 



<210> 136 
<211> 527 
<212> DNA 
<213> Homo sapien 



ccttcaataa 
cttccgtgat 
atctgaacct 
tcaagacctt 
tgcaggaatt 
tttc€tccat 
atgataactt 



60 
120 
180 
240 
300 
360 
420 
448 



<400> 136 

accatggtgt 

atgtccaaac 

actggtccaa 

tgtggggact 

cctccttcac 

attaaagttt 

gttgatgtcc 

cttccttcct 



cagcaatttc 

caacaaactc 

tagagaaaac 

gagccaggta 

tggc^gctga 

ctcg&aagag. 

taagatcaat 

cttctccttc 



ttccataact 
agttagatgt 
cttctcaaaa 
tgcattattt 
aataatttta 
atgg'bagatg 
igactcfcgttg 
tgcctcaggc 



tcgtggtaat 

ctatgggtat 

tcagcacaaa 

ttaaaatatg 

ggagtttgga, 

ccagactgga' 

tctaatcttg 

cgaacagcat 



ggtaattaaa 
tagagtcttc 
tgcacatttg 
acacagtaaa 
tttccacaaa 
gacgga^agac 
tatcctggtt 
catccagctg 



agccatttca 60 

cgctctgaat 120 

cttatatagc 180 

aacattggct 1 24 0 

acctt'tgtta : 300 

tgcctgacta* ' 360 

aacagtagct 420 

caggggcaga 480 
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cggggttcag ccaaactgat cacataaatc ttctgaacat gtaactc 



527 



<210> 
<211> 
<212> 
<213> 



137 
275 
DNA 
Homo 



sapien 



"<;220> - - > • fcr :- . 

<221> nd.sc_f eature ; , ■. -v-." t - ; '< : 

<222> (1)...<275), ' 7 .. , .. -5 ■ 

<223> n = jA,T,C./pr G ?I . : . ] T - ; • j • -. 

<400> 137 

acgacgagtc gggcccctcc atcgtccacc gcanntgctt ctaaacggac tcagcagatg 

cgtagcattt gttgcatggg ttaattgaga atagaaattt gcccctggca aatgcacaca, 

cctcatgcta gcctcacgaa actggaataa gccttcgaaa agaaattgtc cttgaagctt,, 

gtatctgata tcagcactgg attgtagaac ttgttgctga ttttgacctt gtattgaagt" J 

taactgttcc ccttggtatt tgtttaatac cctgt 



60 
120 

240 
275 



<210> 138 . .,; Kf • 

<211> 354. -, 
* K2 13> Homo sapien :} t . : , . r . 0 



.tj:r.'= r 



- i ■■ 



"i * 



"T * 



J ! J J - 



<400> 138; :t.j M • '..->■-, -I" - . • • • ■ - ' j ■ , ; * ; -.' . r , J ; 

. caagct caag, g r tgtttctgt. .cag : gaatgcc ; agagctgcg'g; c r tgggcctca at^agcgcgt 
fgctcttcgag ctcactggcc gcagcaagaa ; c^aatcagU gagctgggggj , atgtaaaatt 
ccaccagtgc gtgcggctct ctcgctttga caacgaccgc accatctcct tcatcccgcc 
tgatggtgac tttgagctca tgtcataccg cctcagcacc caggtcaagc cactgafectg 
gattgagtct gtcattgaga agttctccca cagccgcgtg gagatcatgg tcaaggccaa 
ggggcagttt aagaaacagt cagtggccaa cggtgtggag atatctgtgc ctgt 



<210> 
<211> 
<212> 
<213> 



139 
527 
DNA 
Homo 



sapien 



60 
120 
180 
240 
300 
354 



<400> 139 ^ ' vr . ( , ■■: : ■ 

acgaggaatg acctctaggg cctgqgcaac agccctgta^! ggccattgltt cpacaccagt 
catggccttg gatttttctg tcaaggcatg ggccacagcc atct;cggagg cfcccacpccc 
tggcaccagc tgagggtcca ggagaacatt gcgacacact to;ca';t;gpcat cctgg^ggtt 
gcgttctact tccgagagaa £dtc£.ttgct agccccccgcj aggag.aatgg tgcaggcctt 
ggggtctttg cagtcagtga tgaaagtaaa gtattcatct cc^att:.ttct tga'tttccaa 
caggcctgct cctgttccaa catcatcttc tctcagttcc tctggtcggc tgactatccg 
ggccccacag gctctagcaa tgcgattatt gtctgtcttc cggactctgc ggatggctgt 
gatattggcc cgcataaggt agtgctgagc taaatctgag atgccctttt cagtgatgac 
cacatcgggc ttcagttgga taatgtcctc acagagctgc tggatgt 



60 
120 
180 
240 
300 
360 
420 
480 
527 



<210> 
<211> 
<212> 
<213> 



140 
396 
DNA 
Homo 



sapien 



<400> 140 



acgccactgt' .c.tct.tagata taatt'atccc cacTccictgc .tcatttgttt cccagattca" 
. atacattgt'c; aaagcctctt ggjtccttttt? taacatctca' cac^tgtgtc; attctqtcca/ 

ttcccataaa .cctcaacaac .tgctcaaagt cctgQttgac/cccttgtt^c cagtctttga 
. aatct tt ct t: jgpata tgact ' gcc.t cattap. ,ctt cctaaaa t c tagtt ca'c tcgcctact c 



60 
120 
180 
240 



BNSDOCID: <WO 01 96390A2_I_> 



WO 01/96390 



39 



°¥^Ms6i/18577 



aagaagacac aggggcctac tgtggtgtat tagataagtt cacatttctt ctctttacta 
atctttttta cttcctttac caccactccc ttatataatt ccatcatcct aatagafectg 
tttccctaca catccctgcc tctccacccc acatgt 



<210> 
<211> 
<212> 
<213> 



141 
490 
DNA 
Homo 



sapien 



3 GO 
360 
396 



<220> 

<221> misc_f eature 
<222> (1) ' ( ; 490). 
<223> n = A,T,C or 



<400> 141 

acaaccagct 

tcaagatcgt 

agcaactgcg 

cggtgtccat 

gggaagagca 

ctctgccaga 

tgcccacatt 

tcgcttacac 

ctgttcctaa 



gtgctataag 
agagggaatc 
aagctcacgg 
ccaggccagc 
gtctgagtgt 
cacgggctgg 
ctcttnctgc 
cgcaggcccc 



aaagagggag 
tgcctcctgt 
gggggcctcc 
tctggctccc 
agcagcgtcc 
aaacatgaac 
ctgctgttcc 
gcttctgtgt 



ggcctgacca 
ctggggatga 
tccgggatca 
tggatgacac 
atactgcagc 
gcaagctctc 
ttctccttta 
gcttgtcctc 



taactacacc 
tactgagtgg 
tgtatgcatg 
agagacggag 
cactccagaa 
ctcanagagc 
tggacgtcta 
ctcctcctcc 



aaggagaaga 

gatgacctca 

aagacagaca 

cagctgttac 

agacgaggct 

caggtctaaa 

g;tccttgtgc~ 

cacccc'ataa 



; eo 

120 

:i8o 

240 
300 
' 3^6 0 
420 
486 
490 



<210> 142 , 

<211> 511 

<212> DNA 

<213> Homo! 

<400> 142 

acatccagtc 

ctatcataag 

gttaggtgct 

tcataaccaa 

tattacaatc 

atcaaaataa 

gcactggtaa 

tgttcacctt 

ctctgtgtcc 



sapxen 



tgtatttctt 
ccatcactag 
atgaccacaa 
aagaccatat 
tgagcctctg 
cctatcatac 
ctttgtccac 
taacgtgggg 
tctgtggaac 



„ I. 



acacaaaatt 
gaacttctag 
tgaatacaac 
ttcaaattaa 
aaacaggggg 
tacaaagttt 
tgttagagat 
aaagttggca 
ctggcttttt 



<210> 
<211> 
<212> 
<213> 



143 
463 
DNA 
Homo 



sapien. 



a cat: c t a aaV 

tctgtctcac 

agacagcctc 

atcatagtag 

aacatataat 

tcacttccaa 

taaaacttcc 

aaaagaaccc 

t 



^atttgacatg 
tcgattgagg 
tcagctgtgc 
cgaatgacat 
ggtatccaga 
aaagtgtaac 
aaagcaaatg 
caggaggaca 



aggfc'catttg 
ctacaatgtt 
tgcaaagtat 
accatttaca 
acatctttac 
agagtttaag 
aaagaaccaa 
cccaaacctt 



'60 
120 
180 
240 
300 
360 
420 
480 
511 



<220> 
<221> 
<222> 
<223> 



mis cofeature ,. r 
(1) . (4 63)," \ \ 
n *= A, T~, C or G 



f "; 



. I. 1 . ' 



<400> 143 

actgcagtga ctcatcagag tagaaggagt attcaataag tgggacttct gtgtcgttaa . . 60 

attgggcata tgctaaaaaa gtgccgtttg gagaccacca cagagcagag taggcactga 1 120 

agacttcctc ttcataaacc cagtcagtta ttccattata tattatatct tctttccccg ' 18 0 

tccatgtgat tctgtaactt ggtaaatttg gttcaatttt aacataaatg tcattgttcc 2.4 0 

aaacatatgc caatttatga .cccacjtcrgrtg accatgtgac. .cca.ctgtgtg ttgtttggaa 300 

tcctctcttc tgtaatcagc tgccttttat ttaaatcata aatgtcatat gaagctgtgt '360 



t 
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aggaatgcct. ccattgcttc acgtagttgt attctaagag aataaactgc ccatcangag 420 

atattgaata. "afccatfcgata gaatgnccaa actcatcaaa tat'' ^63 

<210> 144 ' ' 

<211> 297 

<212> DNA ' ' . 
<213> Homo sapien 

<400> 144 7 - J * •■-■*■'■■ 

actcattaat attattttgt tttgagaaag ccagaaatga ttctaagaaa taaacaataa 60 

taataaaaga tgtaattaat atactgtatc ccttttaagc caaagcacac tttttacctc . . ,i- 2 ,9 

aagactgttc tgacttttac attcttaatt tcctttgtcc aaaataggac -cccatttfe'aa . ; * i66 

atagagttca tttgaattga gttcataatc taaagtcact tttccccaca agatgttttc 240 

atttcagtat ataaactgct aagcggcaaa tgactaagtc agttataaag aatttgt 297 



<210> 145 
<211> 356; 
<2l2> DNA, ... 
<2 1 3 > Homo _ *s api%n 



4 



<22o> 7 ^' v '.' : - - 3 * - : ' -*'-'-? '• ■ o ; •• - 'y 



■***<221> misc^fe^ture 
<222> (1 {35 6/ 



<223> n = A,T,C or G 



<400> 147 ■ ' ' - 5 ' ' : '"V ; " }? : * ~ ■ ' : " ' 

ait'tttttact t tatatatga aaatgt catg a^aat'ttataa ' ^caataatgt* attgatactc 



\ - 
-i --t 



356 



1 : 
-.i 



<400> 145 

actnctgcac ctccttcagn aggaggncaa aggggaatgg cgacagctgc tcaatccttg €0 

tgatggncac ctgccccacc atgtcgcgtg ctttgcgctc ccgggttgag gtcataatac 120 

actttgccgg tgcagaanag aagccttttg acattttctg ggntctgagc tgcaaggcca 180 

tcttctggga tcacccgctg gaanngggtn cctggaagca tctcatcaaa gfetggatctg 24 0 

gcctcggggn ggcncaacan ggatttgggg gtgaagataa ttaacngctt ccggaatggc 300 

agcnggatc^ gjgcgtogtaa cacgtggaag aagctgcca.c gagnggagca nttgac 

<210> 146 P :r< ;;^ ' -';:v, o-n.ir-./ '.i-jj .,^^,^ ti no 

<2ii> 355 ' J ; : ' 
< i 2,i2> dna ; ; ;: ; , J; / ; " 

<213> Homo sapieh - 

<22o> . ; : , / ' >: _ ■ y 

<221> misc_feature ' . - . ^ - ^ . 

<222> (1) . . . (355) ; . . ..." ; L . r 

<223> n = A,T f C or G 

<400> 146 / 1 

acagttttgt tttctcgtaa ggggagcatc atagggttac tttataccag ttgtaacatt 60 
ttcattgttt ttggttgttc ttttttcttt ttttaatggc agctaaagat atacagatta 120 
ctgttaaatt gcagtccttt tttttttaaa natattttct tgagttattt aaaacatggt ^180 
aagcctggta ttttttaatc aaacaaaata tttatgaaan gggttttctc ttaattctgg, 240 
attcatcatg gctttctaat accaattgta atatttacaa tattcaccaa 'aacttagaat ' "" '300 
tttgcaaatg ctggaattct gccagtgttt ctttgctaag ccttgcatgc aaaat 355 

<210> 147 

,<211> 209 , . * 

<212> DNA " ' ' :! ' ' * • "t*-y - r -' .= : i - ' ; ■ 

<:213> Homo sabien : 1 ' ---^ ■-;:'■:*■:'..■.)■ - . - ■ . : \ - . -vv. ■ * 



60 



BNSDOCiD: <WO 01 96390 A2J_> 



WO 01/96390 



41 



( T^THjS6ih8577 



aaatttttaa aaatttttaa attttaaaat at'ttaatcaa cttctattat ttttcctctt 120 
ctgggatgaa ttaagtggca aacttggcca ttctaatatt tactcactga tagccaaatt 180 
ttatagcgtc tctatctaaa gaagacagt ' ' 209 



<210> 148 
<211> 445 
<212> DNA 
<213> Homo sapien 

<400> 148 

actcccagca aatcctctga atactccaca .gactatgtta cccagtccca aggctfattaa 60 
ctcctgattg ccatcaagtg gataatcgta tttgagggaa tagacgctgg caactgaaaa 120 
ggccactgca aatgcaacca ttgcgatgcc gaagcaatct cctacggtgt tttg'gaaagt 180 
ctccacgtca ggtgtaatag ggggctgaaa tccaggattc atgtccccaa ccacagccac 240 
tttaaacctg, tttttaaagt cacagccgta ggatacacct. gctgcaatca, cggtcataat 300 
gaattcgatt ggaatgggcV ctggaagttt . gtctttgaag Ggctgattta' .'. . 360 

aatggataca f acqaaaagga caat'cagagc. ."t^pVc'cagg" tctgcaatai tagtctitctc V'l' [ 4£p 
tatttgtgag \. a^tacagagt. atagt ,''„,- ~ \ . . J. ; . .// . V .4"45 

<210> 14^ , .... .. . , ., ... . , /• , 

<2ii> 585 : r , ;.. ;.: . : ; . _ . .j.. 

<212> DNA 1 . ........ 

<213> Homo sapien 



<220> 

<221> mis cofeature 
<222> (1) . . . (585) 
<223> n = A, T, C or G 

<400> 149 « ■."!"" 

actattaatg . agaacgaaat acacattagg aaaaltggagc ,catttcaatc^.tagtggtt.t.g ., \ + ,.. V 6.Q 

ggcaagatgg "ggaagagaag .gggaa^.catt [ .c.t^gij: t cjt g? t ^ t^kc^i 1 1 a . ' tta|:gcccct ; 1 120 

cctgaaaagg ( tgg^ gaatgtaact 180 

gaggattatCjVitca^gcaa^ ^ j^agbaagata;! tcbtc^ctcat ggt'cccttta\ gctctcaaaa 240 

gcaatgaaat ccfecctgttc tcatttttac tgctgtggtt gtgctgctga^ acaatactat , 300 

cttctcaaat tccatgccac aaattcagca ataacttttt ggattgaatt tagcaactac 360 

tgtaattgga tgctgatgtg gacaaaatat attgatttcg atttcactcc cgaatgtgat .-. 420 

tgccaccagc tctttatatt gctgctgtgg tattttaaac cagaagcttc tttaaattat 480 

gttgcaaact gatctttgnt tttatgtttt ggtttggttt tatttctaag tgataagttt " 540 

gaaacacaca gctttaaatg atttttttat tgtgggattt tgggt 585 

<210> 150 
<211> 508 

<212> DNA ..' . "" ' " 

<213> Homo sapien . ( 

<22o> .. ... , t /. , \ ; .; f .. ' 

<221> mis cofeature V ./ 

<222> (i) r . . (5Q8) : , ; ~ ; _ .' 

<223> n = A,T,C or G " " .... 

<400> 150 

acaatgtctt agaaagtctt taagtcacat accatgaatt tttgcttcat tactgaccat. .60 

atatgacctt ggaggaactc tttttttttt ccttctactc atttctgttt ccacctacca . -120 

tgactcaccg tatttccagt cttctacccc tgcagttatc ctagtccagc aaagtcattt . 180 

ntttcaaaan anacatcatg tctgaaaata attactggta gtctaatatg agccanagta 240 

aacagctcct catggtcaat gaacatgttc aggaagcgat caccttgatg cttgaaccca 300 

accccanaca gnggacaatt ntactttgaa atatccgnga atatttactg ggggatccaa 360 



PCTYUS01/18577 



42 



tttaaactt ; c tttnttctnt agcctttaaa ttacacaact ttgaactgac acggatctnt 
tacaaanaac aatgcggcac tgaaggaana gatgattcct* ttactcaaac ctgcaggaat 
cagcctatta acagg'caggg gaaacggt 



420 
4 80 
508 



<210> 
<211> 
<212> 
<213> 



151 
434 
DNA 
Homo 



sapien 



<220> 

<221> misc_feature 
<222> (1) . . V<434)' 
<223> n = A,T f d c or G 

<400> 151 ; ; ; ; 
accatgaata a'aagtgcatt. 
;a;agaagacca crtfcccgaakc 7 
*tgaaaagaat ; gaactttttg' 
agacaaagct agcagcaaaa 
ctcctatagt ctaccctata 
tactaaaatc ctaacactaa 
atctaactcc cgggtaagca 
aagggcgaat tctg 



t b a a taccad 
t a g t g c a a jj a : 

aa$gkataacf 
ttttaatcct 
cgaacagctc 
ttccatatat 
aatcttgnag 



1 1 1 taac a a c 
ga ttgagcat 
cattaagact 
tgatgatgta 
ttcctatttt 
tctgtgtgcc 
taagaggcag 



agcatatagg 
taggcacaaa 
agatgaccac 
gctttcaaaa 
cctctttccg 
aggcatttcc 
tacctgcctg 



gcagacataa 
gg g ;ag;aaaaa 
at tattatag 
ttt'gcattct 
actgtgaagt 
catgcttgct 
gcggccggtc 



<"210> 
<211> 
<212> 
<213> 



152 
320 
DNA 
Homo 



60 
120 
180 
240 
3QQ 
360 
4^0 
434 



sapien 



<400> 152 

actttgcaat 

tgatttttaa 

ttctggctaa 

ccttatagaa 

aatcactctc 

ttgaatgaag 



catctttcct tttttcacat 
1 t tgttgcct''; cigala 1 1 1 ' 
a tctttrgtat 1 ^t'gVt't^t'ta 11 
tg'tggaaatt a;tt^ct : ccat F ' 
cttacctafcrt tatctacaaa"' 
acatctcagt' 1 



tggtaaaaat aagtggcatc cataggatca 
cactccatca agatctgcca atcttcaata 
;Sac&^t^ gt<^tgtcttt Y 

ftccytcftgai 'tttgacitgd gSgctaagatT P 
tgttcattci ^§aaa€gttt" a 



60, 
120 J 
180 
l 240 
'300' 
320 



<210> 153' 
<211> 459 ~* 
<212> DNA 
<213> Homo sapien 



r ~ j' 



<400> 153 

acctcatttt 

ctgttcaatg 

taaaagcagc 

aactggcaaa 

ttgctataaa 

aactctctgt 

tgcctcctga 

atgatgtaaa 



tattagccat 
acagaaactt 
atttttgatg 
aacaacaaaa 
tctacaaata 
ttaacagctt 
atgtccagtg 
gtggatcaaa 



tatcttcatg 
atagagagaa 
tggtaggaag 
gaacaaattg 
ttttaaatat 
ttgaagagtt 
ataaactgaa 
gtatgtaaca 



<210> 154 
<211> 503 

<212> DNA ' ' * 

1 c213> Homo sapien ' 



ctggattcta 
aattccttct 
atatttatga 
ttatttaacc 
atttcctcct 
aattttataa 
ccctaatttc 
agttaatat 



i..- * ■ - 



atattctttt 
caatttataa 
caaaagcagc 
tttaaataac 
actgcaataa 
ggaaataaaa 
cctacctcaa 



taatggtgat 
acaaaaattt 
tactgbccta 
gagtctctat 
aaattaagat 
aagattgact 
caa'cataaaa 



60 

.120 

180 
240 
300 
;360 
.420 

'459 



<400> 154 
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acacagcctt 
tttctggaaa 
ctccttccat 
tcacggtgac 
tcctagcttg 
tcacaaaatt 
gttcaagttt 
ccctcacgat* 
ctcctttcac 



gttgccatgt 

ctcctgggcc| 

gatcattccg 

cagttcactc 

acgtcggaag 

ctggtctcga 

ttc'aattgtg 

cccaccttgg 

caacacctgg 



ctgttgtggg 
aggtgagtca 
gtggggctga 
ttggtagatt 
gataaaacgc 
gaattcttct 
aagtgtattg 
atggcctctt 
tac 



ccacaatcgc, 
cttgaatggt 
tgtggaactt 
ctgaggtctg 
ggtcctgttc 
ctcttttcag 
gggccttttc 
tcttgtccag 



cttgtccttc 
igcacttaatg 
gggt'gtagag 
catatggatc\ 
ctcaacgggg 
ct^gaccat: ( t 
ctctgggaca 
tgtcadcctg 



tgaattatga 
tggagctgag 
aacjgattccg 
ccagaaatga 
aattccagta 
ttttcattta 
gaacagttga 
ggact^gggca 



60 
120 
180 
240 
300 
360 
420 
480 
503 



<210> 155 
<211> 364 
<212> DNA 
<213> Homo sapien 



<400> 155 

actaaatata 

ctgagagaaa: 

attgccatca^ 

ttaaataact 

attattcttt 

aattctgaag. 

gcgt 



gaacacttaa 

agtaaatata 

tttcaatgca' 

ggafetttaaa 

gctatttaat 

caaagtattt 



caaatgccaa 
agaatttaaa 
catatatttt 
acattttccc 
aatg^/tacct 
cagcatttftt 



tcttttgctg 
gttcctttca 
taaaaaacaa 
cat ccacaca 

1 t ga a g a 1 1 1 
caaaa^gtctc 



^a g t g a a a al: t 
tacttgatca 
ttttctctct 
a aaaag.a t at 
"ctacataata 
taatatatct 



taacaattta 
tactataagc ' 
caaactcata 
gtgggttcta ! 
taaacattcc 
tttgtttgta 



>■ 60 
'120 
180 
240 
300 
360 
364 



<210> 
<211> 
<212> 
<213> 



156 
4 52 
DNA 
Homo 



sapxen 



<400> 156 

acatatatgt 

agataaccat 

cacctacaga 

ttctcttcat 

aaattaggtt 

gttccatgta 

aacataagta 

aattaattat 



atattatacc 
tctctaaatt 
ccctttgtgc 
ttgttcaaaa 
gctatattat 
ttagatgact 
atcaaiiaaag 
tttgtaaata 



aatagctagt 
cagtttttga 
acgtggcaaa 
aagagaagaa 
gattgtgcaa 
aagacattta 

ctaatagcat 



aatttcaaaa 
tgtttcaaga 
tcacctctga 
agctttaaag 
taagtattaa 
actcttaggg 
agac'ttttag 

at 



aaaacattga 
aacccaaaag 
aaggcaaaaa 
atatgcctat 
tttcattgaa 
atgttgaaag 
tatatactgc 



cttgagtgtt 

cc i^g t c 1 1 1: t J 

actaactgg'a 

aaataaaaga 

gtttgaccct 

cgcaccacaa 

ttattcaggt 



<210> 
<211> 
<212> 
<213> 



157 
224 
DNA 
Homo 



;^eo 

.120 
180 
24 0 
300 
360 
420 
452 



sapxen 



<400> 157 

acatgaacag caggctgttg cattgtaact tgtggctgtg cattaagatg ttgctgagga 
ttgcgaactc ctgcagcata tttatactgt ggaacggtgc ggacagcagg agtagctgca 
gcggctgcag ctgcaggacg tggacccatt gtctgtgttg atgtgttagc aacacgctgt 
gttgacatga ctcgtggaac ctgtgaagaa gctggtctca tagt 



' 60 
120 
180 
224 



<210> 
<211> 
<212> 
<213> 



158 
623.. 
DNA 
Homo 



sapxen 



<400> 158 

acacatttca ttatgctgcc ttttctctta tgattaaaac tttagccctc attcgaggtt 
tccaatggtt acttttagtg gaggagttcc ctagctttta aaaaaccact tttcctctaa 



.60 
120 
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gattccatta, 

acattcaatt 

acctagtaaa 

agacaagcac. 

aaaacaggaq. 

.-ccctctatat; 

; ttaataaaaa 
ccttaccccc 

"tcctcccccc 



<210> 
<211> 
<212> 
<213> 



159 
422 
DNA 
Homo 



tttattgaaa 

aaaaaaaaaa 

atgaaaaaat 

gttt;cttctt 

agg.ttgtttt 

agatt't'agcgic 

tacatatgtt^ 

aaactcatat 

cacccccatc 



gaagtctttc 
tcacgtattg 
ccagttttaa. 
ttggtctaat 
taatccagtc 
.ttatgctgac 
cattttaaaa 
attttt'agga . 
ata 



tagaaatgtt 

aacat.ctacc; 

oaacagtaac, 

ttattc'taaa. 

tacaaataaa 

ctcgttgtaa 

taattactg^, 

caagat.tttc 



aagcjaggatt 
"aagcatctgg. 
itcattctgc 
cgaagaagct 

q.aagacaatg 
aatctgtatt 
'ttttgcttgg 
ctgca'taacc; 



ttaaatgaac 
. a]c ; t c 1 1 c g g a 
'gg'gtatacag 
gggaactgac 
r cctgagttag' 
ta'actaaaaig 
ctat'cccacp 
acaacctgtc 



180 
240 
300 
360 
420 
480 
540 
600 
623 



sapien 



<400> 159 
aggtaccatc 
gtgggcaaca' 
cqtggttaajt;. 
a.tactagtgc 
tea taa teat 
-atatttccta, 
iat'tcatgatt 

gt 



t : tcttcagaa_ 
a"ctgcatcag 
' .aft cac c'c t a g 
pat. a a a a agtf 
~tctttt£aaa 
.gagaaaaaa't 
ggtgcttctt 



ctgcatctaa 
ccccaaggct 
gttctcccct 
caacatttca 
tacagcati^g 
^'ttgcttqac 
cagtgtgtct 



<210> 
<211> 
<212> 
<213> 



160 
393 
DNA 
Homo 



£ag 4 gctgtgc 
tccctccaga 
gtcccagttt 
a a tat a a aaa 
t ta t a t a tg t 
aaaattataa 
ctcattctct 



.t^P.tgggaa 
cp a a a a g g t <j 
taacartaata 
tt~attttata 
ttgaaacatit ajttaaaataa' 

tataaaacta 



tca.tacagct 
attcatggcc 
tttcatagaa 
caaatgtaat 



a a c a g a a g c a , 
cttagtgtag 



acagcatgaa 



60 
120 
180 
2!40 
300 
isO 
420 
422 



sapien 



,<220> 

. ; <?:221> mis cofeature 
<222> ( 1 ); .7. (.3 [9 3 ) 

n - A, T , C or 'G 



,<223> 



'■ -j ' j O ■„* 



<400> 160 

agctcactct 

tggatggtga 

tccttgagcc 

gtgtggttgc 

cccagataga 

ctacagatgc 

tgctttaagt 



<210> 
<211> 
<212> 
<213> 



161 
223 

Homo 



tttatctgtg 
cctcttttca 
cctctctaat 
ctccttcccc 
agagtctttg 
ctatctgctc 
attacacagt 



tggc'tgjattt 
ctttctctac 
ttaaatttga 
actccagtat 
acttttcttc 
atctttccag 
ttttgnattt 



ca^ttactgtt 
tccatgtctg 
tatta'ttaat 
ccactttcag 
agaggcttat 
ctggattagg 
atg 



t^ftgatttgg 
ggcatgaccc 
tatccaggta 
caaaacgtct 
tttagctaga 
tgttgcttag 



agctactcac 

agctttggac 

attgtcttcc 

tgcttcaagt. 

atgtttaaag 

atttgctagt. 



-60 
120 
180 
240 
300 
360 
393 



sapien 



<400> 161 '"• r ' ; ; 

accacttaat tactggcact gagtatcact gaatttctta gttttctagt ggggaaacat 
tattgagaag ccctccctta ttttaagtaa gttgattaaa tcttatgtga gttgccagtt 
gtaatttttc aaaggaaaaa ttttgatggg gtggaggaat gaattgccag ataatctttc 
tggaattccg agagaattcc aaagagggtt tttttttttt tag 



<210> 
X211> 
<212> 



162 
487 
DNA 



.60 
.120 
180 
223 
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*<213> Homo sapien 



<210> 163 • . - , ; _ ; 

<211> 500 \ / V " ".l . . 

<212> dna' : ; ' f ' ; . :\',\; ^ 
<2i3> Homo sapien' ,J ; J ;' 



60 



<400> 162 

acaagtctac attcccacta acagtgttta aacgttcctg cctctgcatt ctcgtcagca 

tttgttactg tcttttggta actgtcattc taacgggggt aagacaatct ctcattgtgg 120 

ttttgattct ctttagaacg aatatttctc ctcattcctc tactcttaat aatggatttt 180 

ctgaaaaaca tctattaatt ttatgcacta ttcaattcaa acaacttttt aaaagttgcc 2 40 

aaatctgtca caaaatatta aacaacaaga aaaatatcta aaggtaaact tgagaggggt 300 

gtaaaacaaa agactctgag agcgcactta gctgtaaaac aatcattcct attcctaaat 360 

tgagtgtttt tggttacatg ttctaagtgc cttacaataa accaggcaat g;tgctttatc 420 

tggagaaagg gagccctaac ttcaaagttt gagttcctcc aactttttta atagttaaat ^ 480 
ttcaagt 



487 



300 
360 
420 



<400> 163 

acactggatg cagccatgca tggatggttt ttctttattt ttcagtgatt tcctctgaag.; 60 

cagctgcact gatacatttg ggagttggtg gcttgacttt gtccataagg ggcgtggcca : ,3.2 0 

cttcacatga tggcgggcct ttaagagcac aaagaagttt aatatggaca aqaacaggaa 1,80 

aaagcaagaa gaaaacaagt agggaaaaac agctaacctg gagagaaaga atttctttaa 1 240 
cctttatgtt cttcattaaa aatcttatct tggactgatt tgagggattt ttagaaacat 
ggccttattt tatataagca ttaccttccc aggaatcttt gttgtatatt aatttttgat 
aaccatttga ttaactttaa aattaagtat atgtgtgtat atatacatat gtatgtttat 

atacacacat gtatctgtat agttttatat atacatatat acacatagac atacagagaa ; 480 

ccactacttt gtaatagtgt 500 

<210> 164 .. . . . ; . , , ^ , 3 •: :..:iJj.;o 

<2ii> 547 . : / M ; ;• . :> . t *m i-.h 

<:?13> Homo ^sapien f ^^^J . .; i ,;, i:i , n ; ilj ,.^r^v.c: . • -:, y' i 

<46o> 164 r - : : " : " " jr s * r ' ; : , - ' : ' E - .. :: " ' ;> f ■ ; ' . ' ~" ' . ■ \ V 

a ; ctgtaatgg gt'ttggbcaa;atatcatctt ;tgatgacct:c tcctaactca tcagcacctg 
catcagaatg gtcagtaaac caggtaaa^a agctctctgg ttcctcatgc tgcctcttcc 

tgctggcttt attctgcgtt tgactc'gaac gtttcgtcaa atcctttcca gatttccatt- 180 
tgatttcggt ggacttcgaa gatggatcac cactctcatt cagatgaaat tctttggaga - 240 

gaactttatt ttcaaagtaa ggattttcat caaaataaaa atctattctg taacctgatt 300 

taatatcttc aaattctgtc acttcaactc tggtcaaata atgcagtgcc tcttcatctt 360 

cctccccaag cagtgcagac acttgtggat ggttgacaaa tgttgttacc caaaaatttg 420 

ggattttggc gatcaattct gacctcttct gaaaaaatgg ttggcggagt ttgttatatt 480 

tctgttctac tttcaaaatc tcctcactgg cttgttcatt aagtctgtct atttcatttt 540 
gtacctg . ■ . i l ; l : . . ,^ 47 

<210> 165 ; V . . . 

<211> 400 ., ; . . : j . , 

<212> DNA 
<213> Homo sapien 



•60 
120 



60 



<400> 165 

acaaaactta caaagaagtc aaaagtctta acactcccat tctccaggaa ctcttgtctg 

tgtcatctgg taggagggag gaatcctggt tccctcaggt ccttgtcatg ttagcttttt 120 

gatagcttca atccactcgg ctcgctcggc cttgctgctg gcctgaatgt aatagtgtgt .180 

gtcatcctta gtaatcactt tgaagaggtt tccctggaca ttccctttaa ccccagtggg t .240 

aacgccatta tcttccagag; cagidacgag ' tgaaccacga agagaaaacc' cacccactgg 300 

cctgttctct tcttt'ggaag ggtcatagta atgcaggaaa gciggatcct tccttagaac 360 



OMCnnrin- — \aio niofiionA? I -» 
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aaagcgacgc accttccagt ttttcctctt gtgccctgct 



400 



<210> 166 
<211> 274 . . 
<212> DNA \. } ' 

<2l3> Homo 'sapien 



<220> 

<221> misc^feature 
<222> (1) <274) 
<223> n = A,T,C or G 



> J ^ '-.> 



<400> 166 

ggtaccttca tataataaag ttaacaaaaa taataaaata ttaaaaaaaa gagccagctg. 

gcactgccaa ccaattccta tagtagcctt agaaatccta atcctgtaga atttcctctt 

gtagtcaata agcaccaccn tcttcaggag tatttcagtg tattgttatc tacaccaagc 

aagcctggtg atgcagctac ctgagttctc ttggttatgg gtgaatgtta tqtt:cattca 
taacttcccn gctttcatgt aggtggggat agag 



t <210> 167 

. <211> 478 :[': 

<212> DNA * " . ; 

,.<213> Homo 3ap;Leri 



rj- 



t 7 ■ T + 



1 - 



' ' 3 ;'i " ;j > \~ ;.i 



<220> ; _ " . " 

<221> misc_f ea.ture 
<222> (1) . v: ;<4t8!) " ; 
-<2'23> n = &,T f C or G 

<400> 167 

ctttttaaaa tccaatatat tctgccaaga atatgccttg atagttagcc ctcagcccat 

aggtgttttt tgttttttaa cagaattata tatgtctggg ggtgaaaaaa cccttgcatt 

ccaaaggtcc atactggtta cttggtttca ttgccaccac ttagtggatg ttqagtttag, 

aaccattttg tctgctccct ctggaagcct tgcgcagagc ttactttgta attgttggag" 

aataactgct gaatttttag ctgctttgag ttgattcgca ccactgcacc acaactcaat 

atgaaaacta ttt.aacttat ttattatctt^ gijgaaaagna tacaatgaaa attttgjntca 

tactgnattt atcaagtatg atgaa.aagca " atagaiiatat attcttttat . tatggtaaaa 

tatgantgnc attai:;taatc ggccaaatgg ggagrjiggatg htcttt.tcca ghaatata 



<210> 
<211> 
<212> 
<213> 



168 
213 
DNA 

Homo 



t - ... . 



sapien 



J. 



'." O Z s .' ■ 



<4oo> i69 5 ' " . ; .*: ; \"'Z\. ;' ' 

actggctgcg aggcgqcagt . cgatcaatgt' . a;tgacag|gag: ' ctgagacttg ' gccacaccag' ( 
gatcccccat c^gacagatg. ttgatgttg^'\c(ccgga^tt:1^ catgcctcga r gg'agac.tggt 



, 60 
120 
180 
£ 4 6 
274 



- r.' >- 



60 
12Q 
186 
"246 
3.00 

420 
478 



<400> 168 

acaaatgtaa cagtaatgat aaattctctt ttccaaggga aagagaaacg ctgcagaatg 60 

gacattaaac aaggcattat gccctacaag caagacataa aatgtctaag ggaaacttca 120 

gcataaaaat gttgaacaca taatgtgaga taatttgaat- aaataacaac tgacattctt r 180 

tttttaaaaa aaaagtataa aaaatagatg tgt 213 

<210> 169 

<211> 341 . s - 

^212> DNA : 
<213> Homo" sapien; 



, . 1 i 



' 60 
120 
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0 ^ttsfoi/l8577 



ccacaccccc gactagcagg agcagcagtg ccttcttcac atcttcatgc ccgtatattt 
ctggggcgat tgaagctgcc agcttttcgt agaaat.cctc ctctgcaatt tgq.c.tcagct 
cctccctggt gagqj:ctcca gccccagact, catcatcctc actcttgttc ktcttcacaa 
tccgatgggc ttccaggtag gtttctgaga/gtaaaccctg t , t 



180 
2.40 
300 
341 



<210> 
<211> 
<212> 
<213> 



170 
543 
DNA 
Homo 



sapien 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(1) . (543) 
n ~ A,T,C.or G 



<400> 170 ; 
accaatgatc 
aaattttaat 
aagagaactg 
tacattaggg 
aaatgggctc 
aataggttta 
actttctttt 
tccagctttc 
agggaagttt 
agt 



atgcttccat. 

cttataatat 

aagtagaaaa 

gtttggggga, 

ttctaatgag 

aatgactagt 

aatgaccaac 

tcacctgaaa 

ccagcagatt 



tttttttagt, 

agaaatctta 

tagaaatgcc 

atgctaatta 

gtcactactg' 

caaatgaatt 

cttaggtaaa 

atctattgaa 

tcaggctgnt 



t.ttaaaccac 
tgtaaa^gaa 
agtaaacaac 
.tataiitgaga 
aacaaaattg 
attttcttct 
acaaaaatat 
gtgatccctg 
cttaaaggtt 



caaaccaata. 
attttgtcat. 
ataatgttta 
,atatacatta 
ttbcctcttc 
tgttaaataa 
tgtaatccta 
gtcatcctaa 
ttggtggnca 



tttttccttt, 

gtttcaaata 

atttacaact* 

gaactctt;ca, 

tgttaaatag 

attaaatctt - 

gaaattatcc 

taatgggatg 

ttttctcaat 



: " 60 

. 120 
180 

240 
300 
,,360 
4^0 
480 
540 
543 



<210> 171; 

<211> 280 ; 
<212> DNA . 
<213> Homo; 

<400> 171 c 
acatactaaa 
ctactaaaaa 
gattctccat 
gatttcgctg 
gcgcttggat 



sapien 



aatatttaaa 
aataattaca 
catcctccac 
ctccatttag 
aggtcatgtt 



i.r 



"T 



atagagaata ttcctcacag aggacttttt tctt^aatta 
aagtccaaac aggcagagag atttagcaca ctgatcacac 
gcttgctctg aagagggttt aaaaagtcca gtttctcgtt 
ccaaggttgg cctggccact gattggcaca agtgggtaat 
tgtgtcttgg aaatttgggt < 



60 
120 
180 
240 
280 



<210> 172 
<211> 463 
<212> DNA 
<213> Homo:, sapien 



<400> 172 

caggtactat 

gcataccagg 

gqatcacttg 

ctgtgtgcat 

ccttcatgca 

aaacctggat 

ttgagtgtcc 

tgtttggtta 



- 1- 



ttaccctatt 
gttggcaaga 
gaaaaaatcc 
ctcctttcag 
gagctggggg 
ccacctgttt 
cttgttctga 
gcaaccttag 



aataagttcg 
ataatctact 
aacctctcca 
aagttcaacg 
tggtgttcca 
aggagttctt 
aagaaatgta 
ctttctctgc 



gtctctgctfo 
ggtttgggca 
gatgctaaat 
gtatccccgg 
gaatagttcc 
cgcagtccat 
aaaattggca 
agagtggtaa 



gcaatctttc 
cacatgggca 
ttctgccctg 
cctggagctg 
taatgacctg 
tggtccgttt 
ttgtcagtgt 
aac 



cattgctcca 

aggcttgact 

ccaaaacaga., 

taaagggggt , 

catctttggt 

ctctggtagt 

aaagttattt 



60 
,120 
180 

: 240 
300 
s 360 
. 420 
4 63 



<210> 173 
<211> 165 
<212> DNA _ 
<213> Homo 



sapien 



PCT/US01/18577 



48 



<400> 173 _ ■'; ' 

acccaaagaa ctggtggcct J caggccacaa aaaggaaacc caaaagggaa agagaaagtg 

agaagaaabt" gaagatggac' tctattatgt gaagtagtaa- tgttcagaaa crtgattattt 

ggatcagaaa ccattgaaac tgcttcaaga attgtatctt taagt : r. ; ^ ' : 



60 
120 
165 



*<210> 
<211> 
<212> 
<213> 



174 
532 
DNA 
Homo 



sapien 



i. 1 



<400> 174 

actccatctc tttgactgaa taggtcattg atcctatcaa gggataacaa tcftttttgcc 

actggatgtt gatgttccta tccaaatcca cagcaagctg gtgttgcaat tttbeagatt 

catgcagatc cactgacttc agtgtgttga tactggcttt gaagtattcc atccactijgc 

ggatcgtgga atctcccatt aggtatatga gttttcctct caggcattcc ttcattttga 

ctgtagccaa actacaggag acaggattcc atgtgtttct ccagacatgc ccactgggga 

1 1 g t g ga t g t cattcca a ad tttfc alrh t ct : ctt teat t gc - a a c t g 1 1 1 c t : 1 1 1 "if 1 1 g e at t 

t q q a q a cab t a a t tib t a tt ir a'a tt tt?t cca t £a t c t ctac ? a be c ac a t tt *qacc 1 1 1 ca a 



tggag; 
agaggctctt 
t'gagtgcagb 



; <2io> 
<2ii> 

<212> 
<T213> 



175 
374 
DNA ' 
Homo 



t tc 1 1 gft 1 1 g dt a aga't aa g a'dactt t c 1 1 g tt: b 1 1 ag aa taba t gt ga g 
ac a : g%gc d t g 1 : tg ttga'g g be-' 1 tb:a c^'a bag t a- gaaefbet t bt ; tg~ - : ; ? '-' "r . 



> j j 



1 ■:.».-+ 



i > . 



.1 c 



sapien 



V I. 



t - 



60 
-120 
180 
240 
300 
360 
420 
480 
532 



<400> 175 

taatcacctg actgagctcc aattaactga ggagaaaegg ggtggaggag agggctggtt 

gctattcaga cttgataatg agattgatct gtcccatgga gagtgaaagt tcagttccao 

ttctgcctcc ttctttccat gctgtcctca tgctctttat cctcacttcc tcagtccctt' 

caacactcaa aatctgattt tatttctctc teacaegtat caggggcagt ttctgaagtt 

gctgaggttg aattttcttc acaaacctct ataaaacatc agcagagaac atataaatac 

attttgatta gcatacattg caaaatttct cccacaatgt caggggatga aagcaggtgdp 
tccccactga : gagt ' 



<210> 
<211> 
<212> 
<213> 



176 
428 
DNA 
Homo 



■J : f 



<5 i £/ 



i) It 



■ „ ..> • 



j ... 



sapien 



60 
120 
180 
240 
300 
360 
374 



<400> 176 

actgeaactg ccagaacttg gtattgtagc tgctgcccgc tgactagcag ctggactgat 

tttgaataaa aatgaaagca ttaaagggtt tccctacaaa acatttttct ttaaaatact 

tttgaaatgg ctataagcag ttgactttca cccttggaga gcatcacact gtgtgaggtt 

cagtgattgt tgaccctccc cagcccctcc tgcttcttta agttatctgt gtgcgtgcge 

ttcctctcaa^tcttctttgc aogctcattt ctttttctct gaccbatgag aaaggaaaac 
ttactgatga* 1 taatttttaa atagtgtaat t-tattcattt T atagcaftgto -agfgataaatt 

aaaagaacat ttgtctggaa atgetgeegg- gag c eta ft g tgtaaatgta'g'gtattttgt 
aaaataac *" > •■- ■ ■} •■• ■ '■ :r - : e 



<210> 177 ' - * f 
<211> 318 ;; 
^212> DNA 
<213> Homo sapien 



'jr "T S .1.). . . 



f : 'J 



60 
120 
180 
240 

,.•300 
-360 
'420 
•'428 



<400> 177 

acctgaacga agtcgeggge aagcatggcg tgggcegtat tgacatcgtg gagaaccget 



60 
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tcattggaat' gaagtcccga ggtatctacg agaccccagc aggcaccatc ctttaccatg 
ctcatttaga catcgaggcc 1 ttcaccatgg accgggaagfc gcacaaaatc aaacaaggcc 
tgggcttgaa atttgctgag ctggtgtata 'ccggtttctg gcacagccct gagtgtgaat 
ttgtccgcca ctacatcgcc aagtcccagg agcgagtgga agggaaagtg eaggtgtccg" 
tcctcagggg ccaggtgt 



120 
180 
240 
300 
318 



<210> 
<211> 
<212> 
<213> 



178 
431 
DNA 
Homo 



sapien 



<400> 178 

acttgaggct 

tctcctagac 

agagaggcag 

ttgcagcgtt 

ttgaggctct 

gattggaafcc 

gcaatccatt ' 

cttcattggg 



tttttgtttt 
ttgacctaga 
gattggggtc 
ctcgaagtgg 
gctagcaagc 
tactttgtca 
taactcataa 



aattgagaaa 
atattacata 
acagccgctt 
ctgtaggact 
caccagtgag 
tttatccacc 
acatattact:^ 



agactttgca 
ttcctccagt 
cttcagcatg 
cgaatttaca 
gctattgggt 
acagtgacaa 
ctgagtaadt 



attttttttt 
aagtaatact 
gaccaagtgg 
gaaagccaca 
aaccaccttt 
aggaa a ag t g 
ggc cage cat 



aggatgagee 

gaagagcaaa 

gccttgggga 

gaggtgeaac J 

ctatacag^a 

gtgccgttat ; 

teateggate 



60 
120 
180 
240 
300 
360 
420 
431 



<210> 
<211> 
<212> 
<213> 



179 
323 
DNA 
Homo 



sapien 



<400> 179 

actgcccact 

accatcatta' 

gctcagatta 

tgctggcccc 

gcggcttctc 

a&gcagtgac 



tttacacaag 
acataataag ; 
atggaagtgc 
gacacagctc 
atgcacagtt 
actaactgcc 



ctgeagcaga 
gacctcagaa 
tgggcacatg 
cagttccacc 
actgtccctc 
£gt " '•• '' 



actcagttct 
tccaaccttg 
ccacctcctg 
catgacatct 
tctggagggt 



actgeaggtg 
ccaaagaatt 
ccattgtcac 
ggctgaggag 
cctttaatgg 



"J 7> 1 'J' 



agagtattger 
caactcctag 
agttcagctg 
gcttatggga 
; ggactgt gca 



60 
120 
1'80 
240 
30t) 
323 



<210> 
<211> 
<212> 
<213> 



180 
409 
DNA 
Homo 



sapien 



<400> 180 

actgtgttcc 

atgcttcaag 

aatacctttc 

agagctactc 

tttggaagtg 

ttggtctgct 

gatatctcca 



tttgcatgtt 
taagecttea 
ttggcttcca 
tgaggggtca 
tcactgggca 
ctcttttctg 
aatccatcag 



tcttctttaa 
tetgetttaa 
caccagaggc 
catttgettt 
egteggctat 
taattcjagag 
gcacatattift 



agaatttagc 
gtcttctatc 
cagggcagcc 
ggcggtgttg 
gtcactagtt 
aaatttcttt 
tttaagaaca 



tccttctgct 
cttacttgag 
gtggtggttg 
gcctttcctt 
tcaatgccca 
ggatctgata 
atattgeaa 



gtttctttaa 
ggataagttc 
gtctgagctc 
V tcttgtcatt 
tagctctcat 
aagcatccac 



60 
120 
180 
240 
300 
360 
409 



<210> 
<211> 
<212> 
<213> 



181 
460 
DNA 
Homo 



sapien 



<400> 181 

acaaagattg gtagctttta 
accacaacaa tattccaaat 
ccaactaaaa aa&atattija 
aaaaacagtg tgaaatcaca 



tattttttta aaaatgetat actaagagaa aaaacaaaag 60 
tataggttga gagaatgtaa ctatgaagaa agtattctaa - 1 <120 
aaccactttt gattgaagca aaatgaataa tgctagattt ■ '.' ■ 180 
ctttggtctg taaacatatt tagctttget tttcattcag 240 
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300 



atgtatacat.aaacttattt aaaatgtcat ttaagtgaac cattccaagg cataataaaa 
aaagaggtag. e.aaatgaaaa ttaaagcatt tattttgg.ta gttcttcaat .aatgatgcga 360 
gaaactgaat .tccatccagt agaagcatct ccttttgggt ,aatctgaaca aggccaaccc 420 
agatagcaac. atceftaate -pagcaccaat tccttccaaa 



460 



<210> 182 
<211> 232 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1)...(232) 
<223> n = A, TyC - or 



,-<4;po> 182 t 7 v v .. r . , ..." ; ; _' : " " ' ! *'■ - 

actgacagat : taatggcttg^ ccta^a^ctg tgcaagaaac , agcct gccag ncjtgtcattg " $o 

nnagggacca^ .gggcaaaapc ■ aagagctgtt: ^ttx;cpagaa .gagcgctge^ , aaqaca^tgg. " : i20 

tkcgtgcttc^ctt.tac^ ' ■ 130 

taatacactt ttgctttatt ctcacatgcc attcaccaga ttatttgatg gt " ' \ ^232 



<210> 183 
<211> 383 
<212> DNA 
<213> Homo sapien 

<400> 183 



60 



^tgttattta..aaaga > tgaaa...t$tcatggtt ca.aatg=tatt tttctcccat aaaaatattt 

:tptcttccat ttaaafatat,,aqc.taatctt ■ tgagaaatct .tgcacaaatg gcattttatt 120 

raaagaaaatc taajzttacaa , agctttgtaa ,ajttttgagaa \aaacattcat agatcataaa > 180 

^aaattftc aatatgcaat .atjica^attt: acaagaaaat aagcacaaac tttfcagacag v' V 240 

(t^agttatt.g^tgcactpc .rttaattcct ccaaaaaatg 'laggcaaacc 300 
ctaaaaatgt agcagaagca tttccgcaca ctggtgtcca gaatctagtt .tgbgcagaaa 360 



tgtttccact agatttatag agt 

<210> 184 
<211> 444 
<212> DNA 
<213> Homo sapien 

<400> 184- 



383 



acagacacaa;acatataaat atatgtatgc acatatttgt catacatttt caataaatga 6b 

tatctttatt attgtttaat gacctttttt ctcttgtgaa ttttgacata aagtatattt" 120 

tataaaataa. gagagttgtt, gaettacgat - g^attttgta taatacaatt . ttgatctctt 180 

ctgctctcat ttggttgatg. ; tttgcctaaa, atgtcttctt ccacttgcca, ctttcaggct 240 

gatttcacta ctagatctca. agtgactctt gaagagaggc aagttggatc ttggtatata *' 300 

aaattttata taatccctct attcaatgta. tgtgtattga ttggcaagtc . tatttttaaa ' 360 
atatttattt tctgaagaca aagattactg ttattttatt gtttaatgat tcttgtaggt 420 

ctgtttctca ttctatcttc cttt yy AAA 

444 

<210> 185 " <\i\.- 
<211> 289 

<212> DNA ■ , ■:. 

<213> Homo sapien 



<4Q0> 185 , 

acttgtgaea ggcagaegtg>ttgpagc^ aagtccacct '" /' * 60 

gggcatctcc >attggcgtcc acrgtccttga gcaatttatc; .cacggcatcc ctcjtpttttc' 120 
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cactctgcag gaagcctggt agctccttct ccatcagcac cttgagctcc cccttggtca; 
gggtctgcgt gctgecctcg ctgcccgaat, atcgggaaa_a gacgtctatg atcatgccca 
tgactgtctc tagttccgtc atggtgctag attcagaccc accttcctc , 



<210> 186 
<211> 407 
<212> DNA 
<213> Homo sapien 



<210> 187 - 

<211> 441: - - 

<212> DNA- - 

<213> Homo sapien 



<i210> 189 
<211> 225 
<212> DNA 
<213> Homo sapien 



180 
240 
289 



<400> 186 . ^ . 

acagacaaaa tgctcaggat gccatgattg ccctagagca tggatcacct tccpagcaat 
cggtttctgg caggatgcac aatggccctt gggcactgtg gcaatgccaa ggtcctgcaa 
ttcctgctcc agacccccaa gcattgagtc cagggaggcc ttgtgatcct gcttgtctgg , 
taagtgcttc ttgccagcat ctgctctcac tgcaaccttg gcctgcatct cagtcaggtg 
agccatgagc tcatccaact gagcagctgc tgacgtttta gaaggtggtg gtgattcctt , .300 
tggctcttgg gcttcactgt agacattgag ctcctggata ttggtagtat acacgagctg ; .3^0 
cgecggcaag ggacttgtgt/ tatcctgaat .agaaaggatc tccgaag - ; 



.60 
120 
180 
240 
300 



60 
120 
180 
240 



44-1 



<400> 187 - ' ' ' 

actgcaagac ccatcttccc tccagttaat acactcccag gatgggctgc agagggggag 
actctgagag aagctggagg cccacaaaag tccactgacc ctctttctgt cccagaaatg 
aataaaggac ccagttgtgc tttccttcca aaatcctcaa caaagttgtt tgtgctccaa., 
gaaaatgtgg gaataaaaaa atcatgtccc aggtcatctt tgtgtgtgtg cgggggaggt 
ggatgggagg aaaaggcatg tattaataga tactgctgct ataaaatgac ataaatcata , .-300 
gcccttgatc tgtttctgta aacaatgcca gcttcttcag gttattggca actaccccta 360 
atatacctag cccagatcct ttcataaagt caagtgctat atttccaaaa taatcctatg .A20 
aaatcatgaa gcfttgtgaag g , l o:^- ■:. - :i J.' " "V^ = ~ ;r ^;- 

j ■: v.- i <: ■. ■.. . ^z: r- i- r. ZiiizJ :■ J -oo.t -:: -v ■ ,v; ; - . - •. , , :• r 

<210> 188- -<,.-.- J J:: -' • v t .•' 1 . ;co , \V : ' ; r,, ■ 1 * - / ' ~ ? 

*211> 323-- - - ' " -'v-:- . i - - ; • ' ' ' ' ^ f ^ - r ; " : : ^ ; 

<212> DNA- • ! f , ; ' ■ ■ ■■ ■ ■ ^ ' ' ' j 

<213> Homo sapien ' ^ - , - ■•'.-> ' • - ' ■ 1 

<220> 

<221> misc_feature 
<222> <1) . * . (323) 
<223> n = A/T,C or G 



60 



<400> 188 ; " 

acttagaaaa cagtccctgt ccatcagcca gaaaaggtga ccatcacccc taaagtaatt 
tccaaacttt agttcagtgg gaaagatatg ctggtagtgc atattcagng ntgattttca ; 120 
gtgctagtaa ccacttttaa tgccagaaat atgtaacaat gataatgtaa cgtcaaagtg 
gttactaaag attartagcct taactttttt atgnaaaaga taaaatccat .tcctcctccc 
agtgagcaag catggctt^c atttctcaaa a'atgagaact - tccatggcag ccaagaaaac 
gtcttctcag . .aggaac'tt'tc^ gtt L - ^ 1 1 



180 
24.0 
300 
323 



<400> 189 ' ' 

caggtactcc ctgatctttt cctcagtggc ttcaggattc agacccccaa cgaagatttt 60 



52 
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cttcaccggg tccttcttca tagccatggt atttttaggg tcaatgacac ggccatccag 
cctgtgctcc fctfctggtcta ggaccttctcr cacactggct gcatctttga acaggataaa 
cccaaaccct cttgaccgtc cagtgttggg atccattttt attgt ;■- ■ . . , 



<210> 
<211> 
<212> 
<213> 



190 
501 
DNA 
Homo 



sapien 



<220> 

<221> mis cofeature 
<222> (1) . . . (501) 
'<223> n « A,T: r C or G 



<400> 190 • • . • ;•• - ij- fc - ;-. : , -v-. 

acagctgaag ttngataa^a -^agaaatata tataajgacaa: aaatagacaa. nagttaacaa, 
t-aaaaacaca actatctgtt gacataacat ■■ aatggaaactt tttgtcagaa a^ctacatjct 
tcttaatctg attgtccaaa tcattaaaat atggatgatt cattgccatt ttgccagaaa 
ttcgtttggc tggatcatac attaacattt tcnagagcaa atccaagcca ttttcatcca 
agtttttgac atgggatgct aggcttcctg gnttccattt gggaaatgta ttcttatagn. 
cctgtaaaga ttccacttct ggccacactt cattattggg agtgcccaaa gctctgaaaa/, 
, tcctgaagag ttgatcaatt tctgaatccc catggaaaag tggtttctta gttgctagtfc 
cagcaaatat ggtgcctata ctccaaatgt caactggagt tgagtaacga gctgacccca 
gcaatacttc tggagatctg t 



<210> 
<211> 
<212> 
<213> 



191 : 
436 
DNA 
Homo 



sapien 



; y^00> 191 . .■■ , 5 ,.. D: 
/acagtgcatg gtgctgtcac ttggaaagcc 
aatatgaaac atgcagaccc tcctttataa 



tttcaatgtt gtcttcaga-fc tgttgtgatg 
agaaaaagac cttaaaactt gaa^tatgaga 



taattttaca ttttaaaagt ttatttgatt ttcatattat tcactttcaa agccctttoav" 

aatagaaaag gtatgaactt ttggggggat aatttatgta tcgtaaactt attagaacaar •:. 

aatattcctg atgtataatg agttgtttta tttatacaac tttttcaatg gtagtttgca«* 

ctattcttta ttatgctaca ggtttattta ttatgaaaca aaggaatatg tatjtttatgt *- 

attttaccat gcataggtta actctttgcc acagatttat tggttcttga tacacctaaa 
ataaaaaaaa atgtgt 



<210> 
<211> 
<212> 
<213> 



192 
319 
DNA 
Homo 



sapien 



<400> 192 . . 

ccagcgacag actttgcaaa catgcagatg, gttctcacat gtcttccttg .tctcattttc 
agggcacgtg.tcctaggttc tttcgattac gtctctcaatf g'caaggttfcc ; cagatctcto- 
t-gtatcctfca.. cgcttccctt . ttggatgcac cttaatttta aaatacctct ; "t^ tctcatt 
aattagatca cttcaagtta aatacaaaac atggcaagat ggat-ttaaa*? ttagagggat 
ataagtatac ataagagaag accaatctct acttttaaaa atgcagttaa ttaacaataa 
agtaaaatat agtgaaggt 



<210> 
<211> 
<212> 
<213> 



193 
586 
DNA 
Homo 



120 
180 

225 



60 
120 
180 
240 
300 
360 
420 
480 
501 



-6Q 
120 
180 
240 
300 
360 
420 
436 



60 
120 
180 
240 
300 
319 



sapien 
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<400> 193 

acaagaggcc atttgtcttg cctttttctg acatgtgcat actataaaat cacaggtagc 
caacatttag tatcagtaaa aaacaacta.c, gtttgttcac ctgtttggca otagggagaaa 
acaatgtatc tcatagcatt aaatgataca gccttaacac atatgatgct catatttgca 
aagttcccaa atgttgagaa gttctagtga aaag.tcatacj tattgtgcaa agatgaaaat 
ttggggccaa tgtctgtatt icaaaataacc ( aaaatatatt ttaaagcaaa atatatcctg 
atactactat agattctagg aattgtccta aaagagtaaargtgttgtttc ctttctgaac 
atgaataaca tcaaaggaag: aacccagttc^ttaagactta agtaggaaat ttatagaaat 
ttgatttata ccagtagtaa, taacattc.at .aaggaaaaac tattaggtaa caattttctc 
caagaagagg atcagattac ttaaaattgt tggagaattc tggttgtttg cgcaataatc 
atagtgattt >acattgcttt tcttct^ttca gagcaataag. aaagtt 



60 
120 
180 
240 
300 
360 
420 
480 
5 l 40 
586 



<210> 
<211> 
<212> 
<213> 



194 
214 
DNA 
Homo 



sapien 



<400> 194 4 ' : 

acatttttat aactggaatg tttatgtgta gtgaagctct gagaggactt tgcattagat 
ctcagcagca taatcagaag gttgtccttt gtctcagcaa tttttaagct aatagtagca 
gaaattgcag tggaaataga ctgctttgcc acaacattca gaaaatcatt tatctttttaJ n 
ttgcagttct tgtcaccaaa caatacattt tagt * ' ■ * " 



60 
120 
180 
214 



<210> 
<211> 
<212> 
<213> 



195 

325 
DNA . 
Homo 



sapien 



<400> 195 = ■■ /v • ' > ; ' " ' •" } 

actgtacata tttgcaatca cattgtgcat agattcttaa tggtagatat-gatttctttt: 
gtcaggctac aacaatgaac tgcagattcc. ttgtttgtaa tgtaaatg.at tgaatacatt, 
ttgttaatat .gtttttattc ctatgttttg-r ctattaaaaat.titfeataaca tttjpcaagac 
aaaaattcca agttta-tget j t.tgaagaatt, tatgfcaatta- aaatttcact aaactaatct- 
ttttagttta rgSaa^fcattt: ; gggttttgac., actggaagfet-: gcgccaaata; agcatcagaa 
ataggagatg cttaacattg ctata 



180 
240 
300 
325 



<210> 
<211> 
<212> 
<213> 



196 
382 
DNA 
Homo 



sapien 



<400> 196 

actccttccc agttttttet ttataptgag ccttcaggga cagtaagcat tctacagctt 
catttatttt agccttaggg gatttttcag cttttagctt- acgaaccacc tccccttgtg 
cagcaacttc atcatacaga gatttacttt ccagaatact tgctgaggaa ttagaagaaa 
tattctgtcc tatttcagca ggagggtttc caggtttata ttcctggcca gttttctcct 
tatattcagc tttcaaagac aaaagctgtt ttacagctgc atctacatct tcctttggtg 
ctttcttggc ttttaattca cgaaccacat ctccttgaac agccactcta ttgtaaagga 
ccaaggaatc ctcagatgta gt 



60 
120 
180 
240 
.300 
360 
382 



<210> 
<211> 
<212> 
<213> 



197 
648 
DNA 
Homo 



sapien 



<220> 

<221> misc_feature 
<222> (1)...(648) 



WO 01/96390 PCT/US01/18577 
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<223> n = A,T,C or G 



<400> 197 - - - . . 

acatccacat l; gttcetlceaa c ,atgacgtttg gggtcctgct r -tgccaacabfc btttattgcc 
agctgttcag gtgtoatctt 'abcttcttct teta'cagcct ta.ttgtaatt cttg^otaat 
tccaacatct £cttttaccac tgattcattg/ ^tgtttacaat gttcaCtgta gtcotgaagt 
gtcaaacctt ccatccaaet cttcttatgc aaatttagca acafccttctg ttccagttca 
tttttccgat agttaatagt aatggagfcaa taafcgtctgt ttagftccatg aattaatgcc 
tggatagatg gcttgtttaa gtgaeccaga ttcgaagttg ;tttgtcttgg> tteatgtcct 
aagaccatca . tattagcatt gafccaatcfcg aaggcatcaa taacaacctt -tccttttaca 
ictctgaatgg gatccacaac cactgccaca gcfcctctceg' acaaggcttc- aaagctctg*c 
tgagtgttga tatccacacc agaaagccaa caaccaaagc bagggt%adt- gtgataccaa' 
ccaacaacca tctccggcct tcctgtctgc ttcaacatat ccaacatttt aacttggaac 
actggatcaa ctgccttcac actgacacct ggtnctgatg nggcatag 



<210> 
<211> 
<212> 
<213> 



198 
546 
DNA 
Homo 



sapien 



/ r * -/ 



'2 .1 



■<220> -r: : . . ~ 
<221> miscrjj&eabure ; > 
<222> (1) ... (546) 
<223> n — A,T,C or G 



60 
120 
180 
240 
300 
360 
420 
480 
-540 
600 
648 



<400> 198 

acaatacagc accactactg agaagggctc gaggttttgc aatccaaggt tctgacttaa 
agcaaaaata cacggcatag attgcaacag caaagaagtg tccaattaaa actagagggt 
taggagacaa tacagaaagc agcccaacag gacccgcaac acattcgcca ccaagtttga 
aataaagaaa acaggctttt cttagttgat gcagggaatc atctgtggca gaaaataatt 
ea t a a ag agjc ,c t ga gca a gg a t at t c a eg a : \ca a a g ga a t g agatgtttttcttgcccagt. 
aaaatgattt -tttggcctcg: ^aaaabagctgj catcatcata' aaggtcaggg atacccttta 
go a g 1 1 1 1 ct > ,c c a.ta g t fcfcb ; zafcat ctt taae aaxfcaacagt' c a tt c c t ccaicc^gtaagtg 
gatgcctcat ; aitatatgcg ±otic<:c;aaaa: gaagaaeace- i^pgfattcttc,: actgatgaag 
gaggaaggaa >gcttgctgca tggacctcagt_ atgagaattg cagtggittti:t_; aagaabggtc^ 



60 A 
12 0 
180 

300" 
360 
42 0"». 
480 -M 
S4'0*~ 



<210> 
<211> 
<212> 
<213> 



199 
275 
DNA 
Homo 



sapien 



: it : 



<400> 199 

actatgtgta actttggcaa caggtjtgcag; tcagccaggg tgagctcgtt gccatccaaa 
aacttcctct gagagacacc ttcatcttca gcactggttt . catccacttc ttctgggagg 
ggggatgtta agtaattgtc taaaaccttc agggctttca ggagtccctt ctccagattg 
tcattgagtg ctgggtttga, attcttgatg .taggcagaaa atttggcaaa tatgtccagc 
ccagctgtgt tggactcagg; gttcagagot gccag " -' ; ; r ( : . , < ■: ■*, ■ 



<210> 200 
<211> 423 
<212> DNA 
<213> Homo sapien 



. J tr. 



60 
120 
180 
240 

-275 



<220> 
<221> 
<222> 
<223> 



mi s c_feature 
(1) . . . (423) 
n = A,T,C or 
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<400> 200 - ; ; : . ( - ( ( , * 

cctgagaaat tctnaaaagt acgatgataa ggttgcaaaa atgaagaagc tcatcatact 
aaaactagga aacatacnga tccataacan gacatgcnaa gcaaagttcc caaagtcaca 
gacaagaaga gaatctcaaa tgctggaaaa tacataatta tggttgcatg atntaaccag 
tgactctttc aacataaacc ttgcaggcca gaaggaaatt gcgtgctata gttgaggtgc 
caagcgaaaa atagcttcta tgtaagaata acataaccag caaaactgtg ctacaaaaat 
gaagaaaaag caaagacctc taaagataac caaacgtgga aaaattatat caacactaca 
tgtgccatac aaaaaatgct gagaagagtc ctcctattaa aactatatga tgctaaaaaa 
caa 



60 
120 
180 
240 
300 
360 
4 ; 20 
^23 



<210> 
<211> 
<212> 
<213> 



201 
560 
DNA 
Homo 



sapien 



<220> ^ 
<221> mis cofeature 
<222> (1) . * > (560) 
<223> n = A,T,.G or G 



) - 1 j . 



<400> 201 

acaatcgagt attttagaaa ttacatgaaa catgaaacag tttttgcaat tttttttaaa 

ctgggcatct ggtttctaaa aatttatttg aaacaatcta gaattttctt ggtgcaaagt 

gtatcatgtg gaatatcctc atatttttac catattttaa gaactttaag acgattaatt 

gtaaataatt tatttgattg gtgcagttct aatccctaaa tcataatctt aaaatcagga, 

atgtgtggag aacagagcca tgtcatatca ctttgctctt accattcctt ttgatcagcc 

tcaattcagc ctcattgtgt agtatgtttt ttctttctat gaaaaacaac agaaagcatt 

tcattttatt tgcctatgtt caaatatgtt taataatgac caaagtgcat tctgagtttt 

ttcaaggaat ,gtaatactgg agctttaaga acatacttag tjttctcatgt, gaaaacttan 

gctttgtctg angttttcct tcctctattg nctaa.tggtg aggtggtttt aggaa1:tatg 

ttttataact ,tttcaatata : , . > : - ; ^- 5 : 



•>v 



<210> 2Q2 7 VO<V i-i;^.:! ^O: 

<211> 36^;-n- rv!;r,;':; <?- 

<212> DNA , : .-, : r 

<213> Homo sapien, , 



■ > V " .1:'; 

r ■; ■ , r. p 



y j v p s. sv- J 



t:. cv ™ 



is b 

.120 
A 80 
240 
300 
360 
420 
:480 
5 } 4X) 
5 J 60 



<400> 202 

acgagcccca cagagcagga agccgatgtg actgcatcat e atatttaaca atgacaagat 
gttccggcgt ttatttctgc gttgggtttt cccttgcctt atgggctgaa gtgttctcta 
gaatccagca ggtcacactg ggggcttcag gtgacgattt agctgtggct ccctcctcct 
gtcctccccc gcaccccctc ccttctggga aacaagaaga gtaaacagga aacctacttt 
ttatgtgcta tgcaaaatag acatctttaa catagtcctg ttactatggt aacactttgc 
tttctgaatt ggaagggaaa aaaaatgtag cgacagcatt ttaaggttct cagacctcca 
gtgagt • r\ ■ 



60 
120 
180 
240 
300 
360 
366 



<210> 203- •.; v.:: 

<211> 409 r - 

<212> DNA 

<213> Homo sapien 



<400> 203 

cgaggtactg aagaacccca tcatgtgaga gatcgctcaa agtcattaac acaaagcagt 
gaaaatcatc cagcaaagca gtgctattat gagtgtgggc tatggaaaga cagcttttcp 
tacactgata aagaaaaaaa aatgaggaaa ttatttcatc cccttgtgac atctgtgact 
ttttggattt aataatcttg ctgtttttcc tctttatgac aaagaatata attgggagga 
tgaagtgtct taaaaattgt agagaccagc tcactggaat gtttttccat ccctgtattc 



. 60 
120 
180 
240 
300 
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atggcttgac tttgtgactg ctctacactg catgtctgac attgcagagt gagctatgtt 360 

gaggtaaact ggttggttgc attattttgc aatcagcctg gtctctccc 409 

<210> 204 '"" ' ' - ' •• ' 

<211> 440 : ' " ;■ k " ■ - " '■ * e s ' • - 

<212> DNA; - * ' 1 ■ " ' • .,?.:;;::.-• . 

<213> Homo 'sapien * : ; r - vj 

<220> * ' t ■ ; t -;• \ . ; .. ; , W , 

<221> mis cofeature 
<222> (1) . , . (440) 
<223> n « A,T,C or G 

<4 00> 204 

acacacatcc tgatctagct atgtttatgt gtgttggggt gatggatgga caagaggtat - «60 

agttcaaatg agatcatttt tgtgaaatgg ctttgtaaac tgtaacatgc cctataaata 120 

tgagattagc tttaatactg gccctgactc tccagtgtgg ctttgtgtgt ttgtctaaac 180 

acttagttaa tatctgtcag tggtccattg cacaaggaac tgacacaatg gtatectgtg 240 

cctctgttgt tgttgttgtt gttttttttg cagttctaaa agcttagtta attgcbttca- -300 

ttagcttaat atataccacg tgaaaagcat agaaaagcag aactcaaaac tcanagaata -360 

aaggacagaa cataactaac tactgatgtg caccttagtt acctgatgca gggaattgaa 420 
gcatataagc ttcatctacjt - O440 



<210> 205 
<211> 474 



<212> DNA ■• ~ ■ ' " l ' r -' : ' >-~ ; " ' -J-.- ! " > 

<213> Homo sapi^n rv ' ' i; ; ' : V ' ' : 



acttgtccca tgcta^tjtaa cagga-aaatfa a^a'g'tgattg ataagacata-gtccctgtcc " 60 

.^t.^aaagag-tt aacagt'ctag i baaggfcTagga : ^a6tttgagaa aagaccaafcg tgttcaaagg 120 

aaaactcaca acctgggtct cccttctcag atggcacatt caagaaactg ttgcttatgc 180 

ccctgggagc cagagcctta cttaagtctt accaagtcaa atatctatca gcctcagatg 240 

atttgagcct ggtaaagtct tagcaataga tttgctgcct catgttccca tgaaaaccta'^ -300 

ataagagaga gccctttcaa ctcaggcata cggggggttt aaggataaca tgtttagtga^ - '360 

ccatgtggac attcagcaca ggtgagcttc tcaagtgaga gccatgtgtc cccaaaagaa ' 420 

aggagggttt atccataaga ctttgctctc cctttcaaca ctgtggtggg aagt 474 

,<210> 206 . ■■>.. - 
<211> 344 ' ^ : 

<212> DNA S: - 1 -' v "* ■ " ' ■;' ' r ■. ; : 

<213> Homo sapien . ' ■ \ -> v: ■ 

<400> 206 1 - ' •-, 

accgtccttc ttggggcaga tgtctgagal: "aaactgttcc acgcdcccag c.caaaccaca- ; : . ; 60 

gcagttcaac gcatagtgga tggctttcag cgtttcccgc tggggctcat ccttggtttt 120 

cagcttgttg taggtgtcct tgtaaaactc ctggacttcc ttaatcacct catccttgtg 180 

ggaatatccc cagatggccg cagctatttc aatggcgaat atcaccaaga ggaagccgaa : ; 2'40 

gaacagtccc agcatgcact gggactcctg cacagccccg cagcagccca ggaagcccac • . 300 

cagcatcatg agggcgccgg ctccgatcag aatatagact cctg 344 

<210> 207 

<211> 441 ; 

"<212> DNA : ' : [ : : - - ,,;„.,.>.:^',: •.. •. ^ • ■ : t ■ , j0 

r <213> Homd sapien 1 • 1 ; ' - ; ' * " ry: ' r r 'i-'j ' ' ; .- ; .. • ; ,~ 



<220> 1 " i 

<221> misc" feature 



IS / 
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<222> 
<223> 



(441). 



(1). • 
n « A,T,C 



.or, G 



<400> 207 

acctcaattt 

cctcaggaga 

gccacacctg 

caaaattaca 

aacaaaaggg 

tgtctgatcc 

tgtaggacca 

tangaaactg 

<210> 208 
<21X> 365 ; ' 
<212> DNA 1 
<213> Homo 

<400> 20B : ' r 

ctggtgccag 

cctgttaaag 

ccaatggaaa 

gtaattcttg 

gaatatgtga 

ccaaaatacc 

ctagt 



ttcccccaat 
tctgagtttg 
caagttgatt 
tgaaacccag 
atttttattt 
tgttcctaag 
accgtcccag 
gagcaactgg 



ttctggctac 
aatccttgct 
agaactgcct 
tggtaaaaaa 
cacagctaaa 
ttccaaacta 
tttgcctggg 



tactaaaagc 
ctctaggatg 
ttcttcccag 
gcctctgaaa 
gggaaaatag 
tagccaacac 
actttctcat 



cagaaagaac 
caggtggctt 
gcttgacata 
gctgtaacac 
gtggagaagt 
tctgatgctg 
tt;ttacagag 



agaacagtgg 
gaagcagaat 
ggtattaagt 
cctcagtaat 
taaaaaataa 
ctctttttct 
tc'ccaaatcjc, 



sapien 



tgccagtgtc 
aagaaaaact 
catctggatg 
cagtaaatga 
aagatattta 
tactgggtcg 



tgagccagtg 
ttcgcctgag 
tgcccctgca 
tgtggatgca 
tgcttatctg 
ggaagtcact 



ccagagccgg 
cctattttgg 
gaagaagacc 
gaagatggag 
agacaacttg 
ggaaacatga 



aacctgagcc 
ttgatactgc 
tgtgtcaggc 
ctgatccaaa 
aggaagagca 
gagccatcct 



agaacctgacr 
ctctccaagc 
tttctctgat 
cctttgtagt 
agcagtcaga 
aattgactgg. 



60 
120 
180 
240 
300 
360 
420 
441 



.j '■ j' - ■ 



; 60 
120 
;180 
240 
300 
360 
365 



<210> 209 

<211> 191 

<212> DNA; 

<213> ? Homo sapien 



r. y 



t. 



<400> 209 • • ..••-<-.•••'•• 

cgaggta'ca^ ;a^^ataaagg : agaetgtfcga : attcatacca. tataaaactt gttaggtttt 
.taaacati'agc/ : .a^a''tc > aa.g 1 gct ^acaaaaacaa acctgtgittg tttttgtata : 'gattgtagg1: 
ttatttttgg atttcatata catgactgaa ctgtgtgcaa ggcaatagtt agccttgatt 
ttagcccaga g 



<210> 
<211> 
<212> 
<213> 



210 
373 
DNA 
Homo 



<400> 210 

acttaattgt 

agttcccttc 

tacagccgtg 

gcctgtgagg 

tgtttataaa 

agatataata 

ttccagaagt 



sapien 



atatttcatt 
aaaacataat 
ttgtttataa 
ttagcgaagt 
atagctcat-t 
atagtgacga 
acc 



taaatagtcc 
aaaatattta 
gagtgjtaact 
gafeccttagc 
aatgapbtag 
atcagacagg 



ttctcagggg 
oactttataa 
gaagtcctgc 
aaaatgtaaa 
gattgaatcg 
aaagatcctg 



tttaataatt 
aatattaacc 
aaatcatgct 
tgaagatctt 
ctccaaccat 
gctaaaccat 



tagaatcaat 
cgattaacaa 
gttgacacaa 
cagacagtgg 
tcgcatcatc 
ttgcattttt 



,60 
120 
180 
191 



60 
120 
180 
240 
300 
360 > 
373 i 



<210> 
<211> 
<212> 
<213> 



211 
336 
DNA 
Homo 



sapien 



<400> 211 



WO 01/96390 
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a <=tgtaatct ttcttcaf- 

2S 525 =5 SS S5P w 

rr •»••« ss s== sss =55 



60 
120 
180 
240 
300 



<210> 212 

.issss c?-£r «x 1 < - 

a-^s s""'" - ■ 

2sj f mm - in 

"<210> 213- ; ' ■ ' ^ ' - " t}r>-OP-TM V ;, , v . «_.;: - ' Q f ; <; O ": • 434 
~ <212> DNA V • ' :i ' " 

<400> 213 "■'*"■ 



v? J" h^sojs.-: »::.-•:• ■: ? r> z rv. j \> ... : -v. > o ■:> :•?..■: ? 

P j? C j ;» % v • v: £ , : . ■ : -; ,./ z ; ) ^ j 




<210> 214 ; 
<211> 353 
<212> DNA 

<213> Homo sapiens 
<400> 214 



tgtggaagca tatfff f ga aa 9gcagctt tca.t, 

§S SH ^ ^ S3S £ :: 

ttCgCC Ctata ^ gag tc^Sf f^-gct cgagcaSS ^g^^ ' ( 

<210V215 - ' ' ^ att ? act 9Mc cgtcgttlta ^1^^° 300 ^ 

<212> Dm ' . - . 

<213> Hoi «o sapiens 
<220> 
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<221> misc__f eature 
<222> (1) . . . (699) 
<223> n=A,T,C or G 



<400> 215 

acacttgaaa ccaaatttct aaaacttgtt tttcttaaaa aatagttgtt gtaacattaa 
accataacct' aatcagtgtg ttcactatgc ttccacacta gccagtcttc tcacacttct 
tctggtttca agtctcaagV cctgacagac agaagggctt ggagattttt tttctttaca 
attcagtctt cagcaa~cttg J agagctttct tcatgttgtc /aagcaacaga gctgtatctg 
caggttcgta agcatagaga cgatttfcfaat : atcttccagt gatatcggct 'ctaactgtca 
gagatgggtc aacaaacata atcct^ggga catactggcc atfeaggagaa aggtgtttgt 
cagttgtttc ataaaccaga ttgaggagga' caaactgctc' tgccaatttb tggatttctt 420 
tattttcagc aaacadtttc tttaaagctt gactgtgtgg gdactcatcc aagtgatgaa 480 
taatcatcaa gggtttgttg cttgtcttgg- atttatatag agcttcttca tatgtbtgag 54 0 
tccagatgag ttggtcaccc caacctctgg J ^ agaggg'tctg ' gggcagttt'g ggtcgaga'gt 600 
cctttgtgtc ctttttggct ccaggtttga 1 ctgtggtatc tctggccaga' igtgtaggaga 660 
nggccacaag gagcaagaat gctgacactg gaattttct 699 



60 
120 
180. 
240; 
30b' 
360 



<210> 
<211> 
<212> 
<213> 



216 
691 
DNA 
Homo 



-sapiens 



<220> 

<221> misc_f eatiire 
<222> (1) . . . (691) 
<223> n^A^/C or G 



<400> 216 '' : ■ : \ 7" ■ '\ ( . ... ■ 

ncgaggtaca ggtttcacta ttacaaatat atgatgttaa ; actaacaa&c tcatgacctt . 60 

c acacacacati ttbtaattta tdtccatttq r ctattt"ccct 1 



caaagatgtc ttcgtcccac gcacacacat ttgtaatttg tgtccatttg 
tcttctataa tcttcaaatt atatagttat gcattgagtt ■ ccctatgcat ctcacccatc 
tcctttatct cagccttctc atactttgcc attctcttct ttctggaaat aaccagcaca 

tatcaccaca accacaataa cagcaataac accagctttt 



20 
180 
240 
300 
360 
420 
480 



acaattccag caacaactgc 

agaccctgca ttgagaattc aggtgctttt tcatcaacat aataaattaa agtttgacca 
ggatccagat ccagttgttc cccatttact gtcaggtcca ttttcttaga atgaaacaag 
gattcacctt taacatcttt ttcaaaataa taagccacat cagctatgtc cacatcattc 
tgagtttttt gagaagaatt ttgaaccaga tcaatagtga taacattatt ctcatacaaa 540 
atactcgtga taaattttgg atccagttga taacgcgttg tgatctcctt ctgaagtgca 6O0 
gtccgcaaac ttttactatc ataagggttt tctcttgctt tgnggtttag ttcaatggat 660 
gatccagtag ggtctcactc gctcagagca a ' 691 



<210> 
<211> 
<212> 
<213> 



217 
497 
DNA 
Homo 



sapiens 



<400>; 217 ] J \ '• ' V " . . 

ctgtgctcct ^gat$jgtt£t '] acckcaagtc caattgctat ggttacttca gga^gctgag 60 
gaactggtct gatgccgagc tcgagtgtca gtcttacgga . aacggagccc acctggcat;c 120 
tatcctgagt ttaaaggaag ccagciaccat agcagagtac ataagtggct atcagagaag 180 
ccagcc.gata tggattggcc tgcacgaccc acagaagagg ~ cagcagtggc agtggattga 24 Q 
tggggccatg: tatctgtaca gatcctggtc "tggcaag.tcc atgggtg 

tgctgagatg agctccaata acaacttttt aacttggagc agcaacgaat gcaacaagcg 360 
ccaacacttc ctgtgcaagt accgaccata gagcaagaat caagattctg ctaactcctg 420 
cacagccccg tcctcttcct ttctgctagc ctggctaaat ctgctcatta tttcagaggg 480 
gaaacctagc aaactaa 4 97 
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<210> 218 

<211> 603 : - . 

<212> DNA ' 

<213> Homo sapiens c ' 1 

<400> 218 ...... 

acaaaggcga aagagtgg^t ggcaaccgtc aaattgtagg atatgpaa.ta ggaactcaac 60 
aagctacccc agggcccgca tacagtggtzc gagagataat ataccccaaf gcatccctgc 120 
tgatccagaa cgtcacccag aa.tgacacag gattctacac p ctacacg£i: r ataaagtcag 180 
atcttgtgaa tgaagaagca actggccag.t. tccggg.tata cccggagct^ cccaagccct 24b 
ccatqtccag ..oaacjaactcc aaacccgtgg aggacaWga, ''tgctgtggcc .t.tcacctgtg': 300 
aacctgagac tcaggacgca acctacctgt agtgggtaaa. caatcagagc:' ctcecggtca 360 
gtcccaggct gcagctgtcc aatggcaaca^ ggaccctcac; tctattcaat gtcacaagaa 420 
atgacacagc aagctacaaa tgtgaaacpc agaacccagt, gagtgccagg. cgcagtgatt 4 80 
cagtcatcct: ,gaatgt;cctc ; tatgqqgcgg^ at^ccc^a, catttcccct ctaaacacat 54 Q 
cttacagatc .a.ggggaaaat ctgaacctct: ci^^ccacgb agcctctaac ccacctgcac 60b 



603. 



<210> 219 
<211> 409 
<212> DNA 
<213> Homo sapiens 



60 



<400> 219 

ctgagagacc aggagaagtt ccagatgcag agactgtgat gctcttgact atggaattat ^ - , 
tgcggccagt agccaagtta gagacaaaac aggcgtaggt cccgttatta tttggcgtga 12 0 : "' 
ttttggcgat aaagagaact tgtgtgtgtt gctgcggtat cccattgata cgccaagaat 180 ^ 
actgcgggga tgggttagag gccgagtggc aggagaggtt gaggttcgct cccgaaaggt 240 



300 



aagacgagtc tgggggggaa atgatggggg tgtccggccc atagaggaca tccagggtga 
ctgggtcapt gcggXttgca ctcactgagt tptggattcc acatacatag. gctcttgcgt 360 
""^f^f ^acatt.gaat ; agagtgaggg , tbctgttgcc attggacag ~ 409 



■4? 



<210> 220" 
<211> 635 



t.-.-v;.:,- r. ':>o ■ j:;^^ : 



<212> DNA'' ' r " : "= ' ^ J ^' ; " r ' )0£; -:-.v..>;>:^::.; ->j .^y: 6 xr.* i: o £ )>. X^ 

<213> Homo sapiens . ' * ; ' J ' *' :r ' ' •' • '-U*' ' <■ - :* vr^-j.: ; ot,, >: v. 

■• ' * • •: ' ' •" ; ' ; • "> • • • '.JP-. • -• /: .-;r. : ; : - , ?. v.* 

<221> misc__ feature "* '' ~ L ' J 

<222i (I)... (635*) vJ; ^ - 1 ; - • 

<223> n=A,T,C or G ' : r ' - ; "' ■■ " 1 J " 

<400> 220 

acagtgatag ctccccctgg gcaatacaat acaagaacag tgggttttgt caaattggaa 60 
caaggaaaca gaaccacaga aataaataca ttggttaaca tcagattagt tcaggttact 120 
tttttgtaaa agttaaagta gaggggactt ctgtattatg ctaactcaag tagactggaa 180 
tctcctgtgt tctttttttt ttaaattggt tttaattttt tttaattgga tdtatcttct 240 " ' 
tccttaacat ttcagttgga gtatgtagca tttagcacca ctggctcaat gcgctcacct 300 . 
aggtgagagn gngaccaaat cttaaagcat tagngctatl; atcagttacc agcatttggg ifed ' 
gcttttatcc ttcatgggtt atgatgttct cctgdtgaca dajtttqtntg : 'agttttgtaa'^V ■* 
ttcc^gccaa agagagacca^tcactattt gatgg^tggc^tgcafcgcanr catttaaagc 480 
ttttanagaa. tacactpcac; cagggagtat ^actactagt latgSctatta ggagggtaat 54 0 ' 
acca^gagtt ggact^ggca. ccttaggcaa gat^caaacc... anct'aaaata gaataaagaa 6t)0 V ' 
tgagtMgat fagtgtagsc r atitttaa'ccia *j<M£ ; . , :",...'*.' .' : \ '.' ' ■' ' '63$- - ' 

<210^ ^21 *' : -' ' r -.i • 

<211>/484 ' ^■ 'x' ^ .'*. * .;. = - :v ; „ ^ . • : .. ■ ^ : r v ...^ •.; 

<212^ "DNA ' ^ < 
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<213:> Homo sapiens 



<400> 221 -n : *• £ 

actccctgtt ttgagaaact ttcttgaaga acaccatagc atgctggttg tagttggtgc 60 

tcaccactcg gacgaggtaa ctcgttaatc cagggtaact cttaatgttg cccagcgtga 120 

actcgccggg ctggcaacct ggaacaaaag tcctgatcca gtagtcacac ttctttttcc 160 

taaacaggac ggaggtgaca ttgtagctct tgtcttcttt cagctcatag atggtggcat 24Q 

acatcttttg cgggtctttg tcttctctga gaattgcatt ccctgccagg cctaccacat 300 

accacttccc ctggaattgg ttgtcctgga agttctgctg cagagggacc ttgctcagag 

gtggggctgg gatcaggtct gaggtggagt cctgggcctg ggcatgcaga gcccccaaca 

gggctaggcc cagccacagg agacctaggg gcatgatttc agggccgagg aagcaggcgc 
tgtg 



360 
420 
480 
464 



<210> 222 
<211> 566 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> (1) . - - (566) 
<223> n=A,T,C or G 



<400> 222 rt ' 

acattaaagt gtgatacttg gttttgaaaa cattcnaaca gtctctgtgg aaatctgaga 60 
gaaattggcg gagagctgcc gtggtgcatt cctcctgtag tgcttcaagc taatgcttca 120 
tcctctctaa taacttttga tagacagggg ctagtcgcac agacctctgg gaagccctgg 180 
aaaacgctga tgcttgtttg aagatctcaa gcgcagagtc tgcaagttca tcccctcttt 240 
cctgaggtct gttggctgga ggctgcagaa cattggtgat gacatggacc acgccatttg 300 
tggccatgat gtcaggctcg gcaacaggct ccttgttgac actcaccaca ttgtttttca- ?60, > 
agctgacttc cagcttgtca ccttggagag actttagccg- cacca^ggcc ccgatgcctc: ^oar; 
cgctaaccag gatttcatca ccaatgtggt, a^ttpa- S ga£ jS g^gc;aag.t ; .tcc£tggca.fc -4.80 • : 
ctcccaagag tctgctccgt tctcttggtg gcagggctcg gaaggcttca tttgtgggag 540 
caaagactgt gtagacttcc tttccc - 



<210> 
<211> 
<212> 
<213> 



223 
478 
DNA 
Homo 



sapiens 



<400> 223 " \, - . , . v - - ' ' 

caggtactta tttcaacaat tcttagagat gctagctagt gttgaagcta aaaatagctt 

tatttatgct gaattgtgat ttttfctatgc caaatttttt ttagttctaa tcattgatga 

tagcttggaaataaataatt atgccatggc ; atttgacagt tcattattcc tataagaatt 

aaattgagtt tagagagaat ggtggtgttg agctgattat taacagttac tgaaatcaaa 

tatttatttg .ttacattatt ccatttgtat tttaggfcttc. cttttacatt ctttttatafc 

gcattctgac attacatatt ttttaagact atggaaataa . tttaaagatt taagctctgg 

tggatgatta tctgctaagt aagtctgaaa atgtaatatt ttgataatac tgtaatatac 

ctgtcacaca aatgcttttc taatgtttta accttgagta ttgcagttgc tgctttgt 



60 

120- 

180 

240 ; 

3Q0 

36Q 

420 

478 



<210> 
<211> 
<212> 
<213> 



224 
323 
DNA 
Homo 



sapiens 



acgggcaccg gcttccccta. cagatggtca cccacctgca.agtgga.tggg: gatctgcaac 60 
ttcaatcaat caacttcatc ggaggccagc ccctccggcc ccagggaccc ccgatgatgc 120 
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caccttgccc taccatggaa ggacccccaa ccttcaaccc gcctgtgcca tatttcggga 180 

ggctgcaagg agggctcaca gctcgaagaa ccatcatcat caagggctat gt'gcctccca 240 

caggcaagag ctttgctatc aacttcaagg tgggctcctc aggggacata gctctgcaca 300 

ttaatccccg catgggcaac ggt 323 

<210> 225 ' ^ • :i 

<211> 147 ■ ■ — j ; . .'. ' L '1 J ' 

<212> DNA ' ' - -■ "> " i : ; s r . 

<213> Homo sapiens r • ■-•*>--■■ ; ~ ' • *■':. 

<22d> ■ • ■ - • - r • ■ ' - : - -. -r ■• .- ■ 

<22i> L misc_feature ■'■ " ' ; -'-V" - - ■'■ * 

<222> {!)... (147) 
<223> n=A,T,C or G 

<400> 225 

ttggacttct agactcacct gttctcactc cctgnttnaa ttnaacccag ncatgcaatg 60^ 

ccaaataata naattgctcc ctaccagctg aacagggagg* agtctgtgca gttndtgaca 120* 
cttgttgttg aacatggtta aatacaa 147 

<210> 226 '? ■ -'"•..„ : 

<211> 104 ' ! ) • j ■ £ 

<212> DNA - " 

<213> Homo sapiens 

<220> ' ; - '* : ■■ J • > : . cc/:: ' H*: -. ■ r >''. . . . ^^ j ; - : r ■ .-. 

<221> raisc_£ eature 
<222> (1) . . (104)1 : j 
<223> n=A,-T,C or 



<400> 226 - 

nncagghaca -t gtgt^aaaa ^ caa t a t tgta -tactaccata gtgagccatg antntritaaa 60 
aaaaaaatea ^tgttt"tggg :, %f^ " * -"="■» x 5 4 

<210> 227 ^ji^'-:-?^;!-.: : >■ \ .•- ; 

<211> 491 

<212> DNA 

<213> Homo sapiens 

<400> 227 

acactgttgg tgttatatgg ggatggggtt ctcggtaatt ttgtttatta tttatgttta 60 
ttattatgtt ttatcattaa ttattcaata aatttttatt taaaaagtcg ccctacttag 120 
aaatcttctg tgggggtggg agggacaaaa' gattacaaac -caaaactcag gagatggtaa 180 
cactggaatt gataaaatca cctgggatta gtcgtataac tctgaaccac caaacctctg 240 
ctatcaagcc ttgctacagt 'catggctgtc cagaaagatt tacagttatt tttctgagaa 300 
aggatccatg ggctttaaga acttcagaac ' tttaagaact tcagaagttc ttaagttgct 360. 
gaagctcaag taacgaagtt gaatgcaatc aaaaaaagaa taccagggag" tcaaggcttg 420 
agag^cacat tcttatccta aagtgactgc tcaaacctga cgagaccaag • taaattactg 480 
aagatacaaa g f :i - : \V. . ■ ;:x : .: 

<210> 228 
<211> 328 
<212> DNA 
<213> Homo sapiens 

<400> 228 

actcagcgcc agcatcgccc cacttgattt tggagggatc tcgctcctgg aagatggtga 1 60 
tgggatttec attgatgaca agcttcccgt -tctcagc'ctt 'gacggtgcca'tggaatttgc' 120 
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catgggtgga atcatattgg aacatgtaaa ccatgtagtt gaggtcaatg aaggggtcat 180,. 

tgatggcaac aatatccact ttaccagagt taaaagcagc cctggtgacc aggcgcccaa 240 

tacgaccaaa tccgttgact ccgaccttca ccttccccat ggtgtctgag cgatgtggct 300 

cggctggcga cgcaaaagaa gatgcggc 328 

<210> 229 
<211> 689 
<212> DNA 

<213> Homo sapiens 

<220> •• •' ' 

<221> misc_f eature 
<222> (1) . . . (689) 

<223> n=A, T, C or G >;•. . \ , ; v • ; -• ■ :j , . ■ 

■ ■ v , .' > ■ . _ o \-\ .•• . ■ ' , •• - ' ■' ■ • ' - 

<400> 229 • - . - ' . J :. ; v. i < -• .. • ••• ■ ■ - > • ^ : 

accacagcat :catcccttgg tccagaatct ' ac.taecttcc raeagcggccc r aggctccact 60 ; ■ - 
gaaaeaacac tcctacctga eaacaccaca. gcctcaggccz.tqcttgaagp ; afcctacgcGC 120 ,- r 
gtccacagca ;gcactggatc gccacacaca acactgtccc t ctgccggntc : taeaacccgt ,190 , : 
cagggagaat : ctaccacctt ccagagctgg? ccaaactcga;, aggacactac ccctgcacct 240 j 
cctactacca ; catcagcctt. tgttgagctar tctacaacet^ cccacggcag. Qccgagctca ; 300 
actccaacaa .cccaettttc :tgccagctcci acaaccttgg, gcogtagtga, ggaatcgaca 360 
acagte'caca -gcagcceagt,: tgcaac.tg.ca acaacaccct .cgcctgccpa ; ctcca.o.aacc ; , 420 
tcaggcctcg ttgaagaatc tacgacctac. cacagcagcci cgggctcaac tcaaacaatg 480 ; 
cacttccctg aaagcgacac aacttcaggc cgtggtgaag aatcaacaac ttcccacagc 540 
agcacaacac acacaatatc ttcagctcct agcaccacat ctgcccttgt tgaagaacct 600. 
accagctacc acagcagccc gggctcaact gcaacaacac acttcccttg acaggttcca 660 
caacctcaag gccgtagtgg agggaaatc 689 

<210> 230 

<211> 483 : " 

<212> DNA -' - : ■■' .= ' " :rz ;t. j.; 'j z:* ■■- :\. -.y- \ r ^> -w = ■ £ 

<213> Homo sapiens >• : ' : " : ^ ' j j ?*v>-vjz> o^oao-vc-.^ .;;.,;;.■>.;, _■ • - 1 j 

<4 00> ^230 ~ * c- V; i ^/* ; ^ " ' - *■ ■.- : ^ • 

gggttctagc tcctccaatc ccattttatc ccatggaacc;; actaaaaaca aggtctgctc 60 

tgctcctgaa gccctatatg ctggagatgg acaactcaat gaaaatttaa agggaaaacc 120 

ctcaggcctg aggtgtgtgc cactcagaga cttcacctaa ctagagacag gcaaactgca 180 . ^ 

aaccatggtg agaaattgac gacttcacac tatggacagc ttttcccaag atgtcaaaac 240 

aagactcctc atcatgataa ggctcttacc cccttttaat ttgtccttgc ttatgcctgc 300 

ctctttcgct tggcaggatg atgctgtcat tagtatttca caagaagtag cttcagaggg 360 

taacttaaca gagtgtcaga tctatcttgt caatcccaac gttttacata aaataagaga ; 420 

tcctttagtg cacccagtga ctgacattag cagcatcttt aacacagccg tgtgttcaaa 480 
tgt 483 

<210> 231 
<211> 447 

<212> DNA ■■■ f - ■ ■ ' ' : ■; . : ] 

<213> 'Homo sapiens . ' :• ;*r; ■ .' ir -'^ " ' ■ ' 

accctctcta ttcactagct - tctgaaaagg gaggagtatt tttagtttga caatttaata 60 ;:; 
atttaaaaac aagacatctc caggtaggaa aaaatgaaag ctatttcatg caaacattat 120 
ctaatttagcrttaaaagtga aagtggtaat.) actgttggtt - tctgtaaatg: ttgcagggtt 180 
ttaaacttta taattacttt ; aatatttttg ataactagaa, atctagtatt gccataaagg;. 240 , . ^ 
aaactaagtg cccatcaaag: atttgtttgg tataaataaa;. gaattatttg ttttgttttc 300 
aatgacagta agctacaaat - catgatgctt aaaaactttc. taaagatgaa ttgtgtggca: 360 
gtgattggtc tgtttgtgga gaatg.tatga aagctattaa. tattctagaa tagattaata 420 = = 
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aattggetat gttgt-tccaa tgaatgt 



447 



<210> 232 
<211> 649 
<212> DNA 
<213> Homo 



sapiens 



L J. - 



<220> 

<221> misc^feature • : • 

<222> (1) . . . (649) 
<223> n=A,T,C or G 

<400> 232 

gtgggcagaa gaaaaagcta gtgatcaaca gtggcaatgg agctgtggag gacagaaagc 60 ; 
caagtggact caacggagag gccagcaagt ctcaggaaat ggtgcatttg gtgaacaagg 120 
agtcgtcaga aactccagac cagtttatga cagctgatga gacaaggaac ctgcagaatg 180 
tggacatgaa gattggggtg-- taacaccta-c-' accafctatct tggaaagaaa rcaaccgttgg. 240 
aaacataae'c- attacaggga gptgggacrac ; tfcaacaga.tg.Gaatgtgcta./ ctgattgttt 300 
cattgegaat tttttttagc ataaaatfctt- > cit&ttctttt , tgttttttgt .: gttttgttct .360 
ttaaagtcag - gtccaatttg taaaaaeagc'J attgetttctJ gaaattaggg \ cccaattaafc -42.0 
aatca^caa'g- aatttgatcg' - -tt'C'cagttcc: Gacttggagg ; cottbcatcc ctegggtgtg 
ctat^cjatggi cttctaacaa : aaaetacax?a : -tatgtattcc; tgatcgccaa: ccStttccccc 
.'.accagdtaag gacatttccc agggttaata ggge<2liggt ; Gn cotgggagga: aatttgaatg ; 600 
ggtcibatttt gcecttncat ; agcefcaatec ctg.ggcattg^ct&tncact . : ; . .. •>.; j . i ,.:64 9 



480 
5>40 



<210^ 
<211> 
<212> 
<213> 



233 
396 
DNA 
Homo 



sapiens 



<400> 233 

acaatgcaaa acataagtaa tcttttcact attataacac ttgtatgatt ttaagacaaa .60 

cttggcttaa attaagtttt ggggtcagcc ccaaattcct gccccttcac tgttattttga 120 

attattttta aactctcaga tacagcttta tagttaaaac attattagac tatatattct 180 

aaattctaaa gtgaccaaag gggacagttt atgtaaagat aacacttttt cttaattttt tZ4Q 

agaaaaccat tctttcatct cctg^tggtc ttctttttcc gtctctattt* ct.tttgttag .300 
catcctattt ggtagtttgt taatatacat cttccctgag : t.gtttttaca aeacaaagcc 360 
atttagtgat ; tcfcgaatggc tactctgcct gccagt. z> ■ ' . 396 



<210> 
<211> 
<212> 
<213> 



234 ■• 
4627 
DNA 
Homo 



sapiens 



<400> 234 

tcacttgcct gatatttcca gt'gtcagagg gacacagcca acgtggggtc ccttctaggc 60 , 
tgacagccgc tctccagcca ctgccgcgag cccgtctgct cccgccctgc cegtgcactc ,120 
tccgcagccg ccctccgcca agccccagcg cccgctccca tcgccgatga ccgcggggag 180 
gaggatggag atgctctgtg ccggcagggt ccctgcgctg ctgctctgcc tgggtttcca, 240 
tcttctacag. gcagtcctca gtacaact£.t. ga^ttcqatca^t<»tatcccag. gagagtccag. 3Q0 
tgataactgcr acagctttag ttcagacaga > agacaa-tccao^gtgtggctc aagfegjtgaat . ; 36,0. 
aacaaagtgt agctctgaca tgaatggcta;. ttgfcttgcart. ^.gacagtgca • tctatctggt -.420 
ggacatgagt*: caaaactact. gcaggtgtga;agtgg.gtt<cLt, actggtgtcc gatgtgaaca .480. 
cttcttttta accgtccacc aacctttaag^ caaagagtat gtggctttga cegfcgattct ; 540 
tattattttg tttcttatca • cagtcgtcgg.- ttccacatat tatttctgca. gatggtacag 600 
aaatcigaaaa agtaaagaac caaagaagga ' atatgagaga gfctacctcag; gggatccaga 660 
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gttgccgcaa gtctgaatgg cgccatcaaa; cttatgggca;,;gggataacag/tgtgc"ctggt 720 
taatattaat . attccatttt attaataata tttatgttgg gtcaagtgtt aggtcaataa 78 0 
cactgtattt taatgtactt gaaaaatgtt tttattjtttg fctttattttt gacagactat 84 0 " 
ttgctaatgt ataatg'tgca gaaaatattt" aatatcaaaa _ ^aaaat%gat_ aje'tttt.atac 900 
aagtaatttc ctgagctaaa tgcttcattg aaagcttcaa agtttatatg c'ctggtgcac 960 ' 
agtgcttaga , agtaagcaat tcccaggtca tagctcaaga attgttagca aatgacagat 1Q?0; 
ttctgtaagc ctatatatat agtcaaatcg atttagjtaag tatgtttttt atgttcctca 1080 
aatcagtgat aattggtttg actgtaccat ggtttgatat gtagttggca ccatggtatc 1140 
atatattaaa acaataatgc aattagaatt tgggagaagc aaatataggt cctgtgttaa 120CF "" 
acactacaca tttgaaacaa gctaaccctg gggagtctat ggtctcttca ctcaggtctc 1260. 
agctataatt ctgttatatg aggggcagtg gacagttccc tatgccaact cacgactcct 1320 t 
acaggtacta gtcactcatc taccagattc tgcctatgta aaatgaattg aaaaacaatt 13.8 01 
ttctgtaatc ttttatttaa gtagtgggca tttcatagct tcacaatgtt ccttttttgt 14 40 r . 
atattacaac atttatgtga ggtaattatt gctcaacaga caattagaaa aaagtccaca 1500 
cttgaagcct aaatttgtgc tttttaagaa tatttttaga ctatttcttt ttataggggc 1560 . 
tttgctgaat tctaacatjta aatcacagcq. c ; aaaatt,tga tggactaatt attattttaa 1620 f . 
aatatatgaa gacaataatt ctacatgttg tcttaagatg ga'aatacagt tatttcatct 1680" 
tttattcaag gaagttttaa ctttaataca gctcagtaaa tggcttcttc tagaatgtaa 1740 
agttatgtat, ttaaagttgt atcttgacac ; aggaaatggg L aaaaaac;tta aaaat^aata 1800 t 
tggtgtattt' ttccaaatga aaaatctcaa ttgaaacfctt ttaaaatgta gaaacttaaa 1860 
cacaccttcc tgtggaggct gagatgaaaa ctagggctca ttttcctgac atttgtttat 1920 
tttttggaag. agacaaagat ttcttctgca ctctgagccc ataggtctca gagagttaat 1980 
aggagtattt ' ttgggctatt gcataaggag ccactgctgx; caccactttt ggattttatg 2040 
ggaggctcct tcatcgaatg ctaaaccttt gagtagagtc tccctggatc acataccagg 2100 
tcagggagga tctgttcttc ctctacgttt . atcctggcat gtgptagggt aaacgaaggc 2160 
ataataagcc atggctgacc tctggagcac caggtgccag gacttgtctc catgtgtatc 2220 
catgcattat ataccctggt gcaatcacac gactgtcatc taaagtcctg gccctggccc 22 80 
ttactattag gaaaataaac agacaaaaac aaqtaaatat atatggtcct atacatattg 2340 
tatatatatt catatacaaa catgtatgta tacatgacct taatggatca tagaattgca 2400 ; 
gtcatttggt gctctgctaa ccatttatat aaaacttaaa aacaagagaa aagaaaaatc 2460' 
aattagatct aaacagttat ttctgtttcc,, tatiftaata.t^ agctgaag[tc aaaatatgta 25,2 0, r ,, 
agaacacatt" ttaaatactc tacttacagt " tggccctctg tggttagttp ca cat ctgtg 2580 
gattcaacca accaaggacg gaaaatgctt aaaaaataat acaacaacaa caaaaaatac 2640 
attataacaa^ptaft.tact,! t^tttttttt cttttlpgaga tggagtct;cg ctctgttgcc 2700 , 
caggttggag" tgcagtggca cgatctcggc tcactgcaac ctcacctccc gggttcaaga 2760 
gatcctcctg cctcagcctc ctgagcagct gggactacag gcgcatgcca ccatgcccag 2820 
ctaatttttg tatttttagt agaggcgggg tttcaccatg ttggccagga tggtctcaat 2880 .. 
ctcctaacct tgagatccac cctccacagc ctcccaaact . gctgggatta caggcgtgag 2940 
ccaccgcacg tagcatttac attaggtatt acaagtaatg taaagatgat ttaagtatac 3000 
aggaggatgt gaataggtta tatgcaagca ctatgccctt ttatataagt gacttgaaca 3060 
tctgtgcccg attttagtat gtgcaggggg gcgatctggg aatcagtccc ctgtggatac 3120 
caaggtacaa ctgtatttat taacgcttac tagatgtgag gagagtctga atattttcag 3180 
tgatcttggc tgtttcaaaa aaatctattg acttttcaat aaatcagctg caatccattt 324 0 
atttcattta ; caaaagattt attgtaagcc tctcaatctt ggtttttcag ttgatcttaa 3300, ; 
gcatgtcaat' tcataaaaac aagtcatttt tgtatttttc atctttaaga atgcttaaaa 3360 
aagctaatcc ctaaaatagt tagatctttg taaatgcata ttaaataata aagtatgacc 3420 : 
cacattactt tttatgggtg aaaataagac aaaaataata gttttagtga ggatggtgct 3480 
gagtaaacat aaaaactgat ttgctctcag ctgatgtgtc ctgtacacag tgggaagatt 3540 
ttagttcaca cttagtctaa ctcccccatt ttacagattt ctcactatat atatttctag 3600 
aaggggctat gcatattcaa tgtattgaga accaaagcaa ccacaaatgc ataaatgcat 3660 ( 
aatttatggt cttcaaccaa ggccacataa taacccagtt aacttactct ttaaccagga 372.0 
atattaagtt ctataactag tactcaaggt ttaaccttaa aattaagatt tccttaacct 3780 ( 
taaccttaaa attgatatta tattaaacat acataataca atgtaactcc actgttctcc 3840 ,^ 
tgaatatttt ttgctctaat ctctctgccg aaagtcaaag tgatgggaga attggtatac 3900 
tggtatgact acgtcttaag tcagattttt atttatgagt ctttgagact aaattcaatc 3960 
accaccaggt .atcaaatcaa.cttttatgca gcaaatatat ; gattctagtg tctgactttt 4020. ) ; 
gttaaattca gtaatgcagt ttttaaaaac ctgtatctga: : cccactttgt aatttttgct 4080 . 
ccaatatcca ttctgtagac ttttgaaaaa aaagttttta atttgatgcc caatatattc 4140 
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tgaccgttaa aaaattcttg ttcatatggg agaaggggga gtaatgactt gtacaaacag 
tatttctggt "gtafcatttta atgtttttaa aaagagtaat ttcatttaaa tatctgttat 
tcaaa'tttga ;t.gatgttaaa ; tg^aatataa tgtat^ttct' ttttattttg ' cactctgtaa 
ttgcactttt Itdag^ti^tgaa gagp^atttt ggtaaacggt ttttattaaa gatgctatgg 
aacataaagt tgtattgcat gcaatttaaa' gtaacttatt tgactatgaa tattatcgga 
ttactgaatfc igtatcaattt^gtttcftgttc aatatcagct ttgataati^ t'gtaccttaa 
gata.t*):ga,ag gagaaaatag ataattjcaca agatattatt aatttttatt* tatttttctt 
gggajattgaa aaaaattgaa/ataa'ataaaa atgcattgaa catcttgcat tcaaaatctt 
cactgac " ; . ' " • - - " •■• ■--•H.:- ■". i 

<210> 235 ■<- : - y ~'- - ' :~--\ : ••'-■■Jr- - • 

<2ii> 169 : '" "; ■ *.■? ^';:v^' £■ 

<212> PRT ; r ; P' 
<213> Homo sapiens 

<40o> 235' j{' ■ =- -• -• — ■■ ■« : • • ^ ^ 

Met Tfrr Ala jGl^ Arg Arg Met Glu Met Leu" Cys 'Ala Gly Arg- Val Pro ' * 
Ala Leu ieu leu'Cys Leu Gly; iphV Hi s ieu^ Leu ' Gin Ala Val Leu Ser : 



4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4627 



V 



Thr. Thx VaT Jle Pro. Ser Cys lie Pro Gly Glu' ^er Ser As^ Alri 'Cys -" 

". ; - ; 35' J :: - ,o 40 '. :7.- ; : ^ f t : 1 7 J 

Thr Ala Leii Val^Gln' l»hf Glu : ^Asp Asn Pro' Arg Val Ala Gin Ser 



60 



lie Thr 
65 



?hr Lys : Cys Sk Ser Asp Met A'sn Gly Tyr Cys Leu His Gl f Gin 

■"■ \, . " 3 ^ -70^ ' ' : " : . - "■' ■ 75' " ^ = ; ; g() 

Cys" Jft "tyr 'lieu ' Vkl'^ : ile : t ; ! Ser ; Glirr ik' Tji" Cy^ Arg Cys Glu Val 



Gly, Tyr Thr Gly Val Arg cys r ' Glu : tii& Pne Phe Leu 7 

100 : ^ - ' 1 rbs" ^ " 



~j >. /: 



Vai s His £ Gln S£0 

lib : ' ,; - 



t » ■! > 



Pro Leu Ser Lys Glu Tyr Val Ala Leu Thr Val ; lie Leu lie lie Leu 
115 1 120 ; ° 125 ' 

;v " v " * j 

Phe Leu He Th* : Val Val Gly Ser Thr Tyr 1 Tyr Phe Cys Arg Trp Tyr 
130— 135 : 140 

Arg; Ash Arg Lys Ser Lys Glu Pro* Lys Lys Glu Tyr Glu Arg L Val Tnr 
145 . ; 15b 155 : * - 160 



Ser Gly "Asp Pro £lu Leu Pro Gin Val 
' : - ' " 165" 1 



> .', t. 



~.'') 

Ji" 



• j. 



' : '.V O ':' 



<210> 236" 
<211>^ B94 
<212> DNA 
< 2 1 3> ; Homo* s api J en s 

<4 00>. 236 f : 

atgcatcacc atca'ccatca^ fcacggccgcg - tccgat r f 



cagggattcg; bcattdcgat cgggcaggcg- atggegatcg cg^gccagat ; caagct€ 



ccc 



60 
120 
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accgttcata tcgggcctac cgccttcctc ggcttgggtg ttgtcgacaa caacggcaac 180 
ggcgcacgag tccaacgcgt ggtcgggagc gctccggcgg caagtctcgg c ftctccacc 240 



ggcgacgtga tcaccgcggt cgacggcgct ccgatcaact cggccaccgc gatggcggac 

gcgcttaacg ggcatcatcc cggtgacgtc atctcggtga cctggcaaac caagtcgggc 360 

ggcacgcgta cagggaacgt gacattggcc gagggacccc cggccgaatt cgatgccttc 420 

ctgaaatatg agaaggccga caaatactac tacacaagaa aatgtcgcaa tctgctgtcc 480 

ttcctgaggg gcacctgctc attttgcagc cgcacactga gaaagcaatt ggatcacaac : 540 

ctcaccttcc acaagctggt ggcctatatg atctgcctac atacagctat tcacatcatt 600 

gcacacctgt ttaactttga ctgctatagc agaagccgac aggccacaga tggctccctt 660 

gcctccattc tctccagcct atctcatgat gagaaaaagg ggggttcttg gctaaatccc 720 

atccagtccc gaaacacgac agtggagtat gtgacattca ccagccgggg tcaaacagag - ■ 780 

gagagcatga atgagagtca tcctcgcaag tgtgcagagt cttttgagat gtgggatgat 840 

cgtgactccc actgtaggcg ccctaagttt gaagggcatc cccctgagtc ttaa 894 

<210> 237 _ 1 . " 

<211> 297 " c 

<212> PRT 

<213> Homo sapiens „ , ".' ..' 

<400> 237 

Met His His His His His His Thr Ala Ala Ser Asp Asn Phe Gin Leu 

- i " 5 io '- is "' ••- 1 • : - : r: 

Ser Gin Gly Gly Gin Gly Phe Ala He Pro He Gly Gin Ala Met Ala 

20 25 30 / ^ 

He Ala Gly Gin He Lys Leu Pro Thr Val His He Gly Pro Thr Ala 

35 40 45 

Phe Leu Gly Leu Gly Val Val Asp Asn Asn Gly Asn Gly Ala Arg Val 

50 55 60 

Gin Arg Val Val Gly Ser Ala Pro Ala Ala Ser Leu Gly He Ser Thr 
65 7 0 75 : BO 

Gly Asp Val He Thr Ala Val Asp Gly Ala Pro He Asn Ser Ala Thr 

85 90 95 ; ; - 

Ala Met Ala Asp Ala Leu Asn Gly His His Pro Gly 2&p Val -life- -Ser 

100 105 HO 

Val Thr Trp Gin Thr Lys Ser Gly Gly Thr Arg Thr Gly Asn Val Thr 

115 120 125 

Leu Ala Glu Gly Pro Pro Ala Glu Phe Asp Ala Phe Leu Lys Tyr Glu 

130 135 140 ! 

Lys Ala Asp Lys Tyr Tyr Tyr Thr Arg Lys Cys Arg Asn Leu Leu Ser 
145 150 155 160 

Phe Leu Arg Gly Thr Cys Ser Phe Cys Ser Arg Thr Leu Arg Lys Gin 

165 170 175 

Leu Asp His Asn Leu Thr Phe His Lys Leu Val Ala Tyr Met He Cys 

180 185 190 ' 

Leu His Thr Ala He His He lie Ala His Leu Phe Asn Phe Asp Cys 

195 200 205 

Tyr Ser Arg Ser Arg Gin Ala Thr Asp Gly Ser Leu Ala Ser He Leu 

210 215 220 

Ser Ser Leu Ser His Asp Glu Lys Lys Gly Gly Ser Trp Leu Asn Pro 
225 230 235 240 

He Gin Ser Arg Asn Thr Thr Val Glu Tyr Val Thr Phe Thr Ser Arg ' ; 

245 250 255^' 

Gly Gin Thr Glu Glu Ser Met Asn Glu Ser His Pro Arg Lys Cys Ala 

260 265 270 - ; ' 

Glu Ser Phe Glu Met Trp Asp Asp Arg Asp Serf- His Cys Arg Arg Pro 

275 280 285 

Lys Phe Glu Gly His Pro Pro Glu Ser '' \ ,' 

290 295 ' ' 
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<210> 238 . - c , r 

<2ii> 25 I*.. ; , . 

<2;12> DNA ; ; : J } . .\, 

<213> Artificial Sequence 



<220> 

<2.23> PCR r primer 



<400> 230 * " 1 

ttttcttgtg tagtagtatt tgtcg 

<210> 239 " " 

<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 239 v 

tgtcgcaatc tgctgtcctt cc ; 
<210> 240 

<211> 22 - . .. - , 

<212> DNA *- ' * 

<213> Artificial Sequence 

<220> 

<223> PCR primer 

<400> 240 " r " r " 
gctggtgaat gtcacatact cc, . r ~ 

<210> 241 ti . ^ 

<211> 20 
<212> DNA f 

<213> Artificial Sequence 
<220> 

<223> PCR primer 

<400> 241 I , 

cggggtcaaa cagaggagag 

<210> 242 J 
<211> 33 
<212> DNA 

<213> Artificial Sequence. \ 
<220> 

<223> PCR primer 



2a 



-> ... i :. 



I • A. 



J ' < i ■> ■ 



I.J 



0 0.;. 



22 



'."'Ti 



20 



.» ..*r j 



<400> 242 ... ' " 

gtcgaattcg atgccttcct gaaatatgag. aag r 'i 

<210> 243 '"' 1 

<211> 33 



33 



j 1 
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<212> DNA 

<213> Artificial; Sequence 



<220> ' - - • - ■ ■' \ " 

<223> PCR primer • 

<400> 243 

cacctcgagt taagactcag ggggatgccc ttc 



<210> 
<211> 
<212> 
<213> 



244 
2609 
DNA 
Homo 



sapiens 



33 



<220> 

<221> misc_feature 
<222> (1) . . - (2609) 
<223> n - A;T,C or G 



<400> 244 

gctgatagca 

ctcttggggt 1 

atgttccatt 

gaagggctcc^ 

ttttgtttct 

aatatgagaa 

gtgcccgagc 

gtcgcaatct 

agcaattgga 

cagctattca 

ccacagatgg 

gttcttggct 

gcgttgctgg 

ctactgagtt 

tcttctatat 

aggagagcat 

atcgtgactc 

ggatccttgc 

agcagaaggt 

tgaacaagcg 

tctctctcct 

ccattcatat 

aatattcacc 



ttttccagta 
ctatcttgaa 
agatctattt 
tgacttccct 
tcttcctcac 
ccactgacat 
atgagttttc 
gccctcggac 
ctagaaaggt 1 
taatctgcat 
tctgcatttt 
cactttagga 
cattcaactt 
gaanttctta 
ccatgactgt 



cagttctgtc 
aggtgtgtgt 
cctgaaggac 
aaaccacctc 
ggtt^tttgg 
ggccgacaaa 
gtctgctctc 
gctgtccttc 
tcacaacctc 
catcattgca 
ctcccttgcc 
aaatcccatc 
tctcactgga 
catccggagg 
ccttggctta 
gaatgagagt 
ccactgtagg 
accggtcatt 
tgtgattacc 
tggcttcagc 
ggaatggcat 
ccgagcagca 
aattcccagg 
tgaagtggct 
atccatctgg 
ctactggatc 
ggaacaggag 
cggatgggac 
cgtgacaggt 
tacaatagcfc 
tttggcaaag 
tcaattctac 
tttgtctctt 
gtctcttHgt 
taagaatgtg 
cgttacttga 
aagcccatgg 
agcaaggctt 



cagagaagga 
t£ttcacatc 
ctctccagaa^ 
ttgacaatgcf 
ttagggctga 
tactactaca 
tgcttgaatt 
cfcgaggggca 

accttccaca 
cacctgttta 1 
tccattctct 
cagtcccga'a 
gtgatcatga 
agttattttg 
gggattcacg 
catcctcgca 
cgccctaagt 
ctttatatct 
aaggttgtta 
atggaagtgg 
ccttttactt 
ggggactgga 
attgaagtgg; 
gtgctggttg 
tacaaattcc 
tgcagggaga 
atggaggaat 
agca'atattg 
ctgaaacaga 
acctcccacc 
agcctgegca 
ttcaacaaag- 
tgtatcttca 
atcttcagta 
cctctcaagc 
gcttcagcaa 
atcctttctc 
gatagcagaa 



aggcggaata 
ttaaaggctc. 
tccggattgc 
gaaactgggt 
at^ttttcct 
caagaaaaat 
ttaacagcac 
ccfcgctcatt 
agctggtggc 
actttgactg 
ccagcctatc 
acacgacagt 
caatagcctt 
aagtcttctg v 
gcattggtgg 
agtgtgcaga 
ttgaagggca 
gtgaaaggat 
tgcacccatc 
ggcagtatat 
tgacctctgc 
cagaaaatct 
atggtccctt 
gagcaggaat 
agtgtgcaga 
caggtgcctt 
taggcaaagt 
ttggtcatgc 
aaacctcctt 
ccaagtctgt 
aatgctgtca 
aaaatttttg 
gtaattgagt 
atttacttgg 
cttgactccc 
cttaagaact 
agaaaaataa 
gtttggtggt 



aacttattca 
kcagaccctg 

tgaatcttcc 
tjgttaa^cac 
gtttgtggat 
ccttgggtca 
gctgatcctg 
ttgcagccg6' 
ctatatgatc 
ctatagcaga 
tcatgatgag 
ggagta/t'gtg 
gattctcatg 
gtatacteac 
aattgtccgg 
gtcttttgag 
tccccctgag 
cctccggttt 
caaagttttg 
ctttgttaat 
tccagaggaa 
cataagggcfc 
tggcacaigcc 
tggggtcacc 
ccacaacctc 
ttcctggttc 
gggttttcta 
agcattaaac 
tgggagacca 
agtgggagtt 
ccgatattcc 
agttatagga 
tataggaata 
tctcntcagg 
tggtattctt 
tctgaagttc 
ctgtaaatct 
tcanaattat 



ttcccaggaa 
cgctgga'caa ,J 
ctgttgccta 
tggttttcag 
gccttcctga 
acattggcct 
cttcctgtgt 
acactgagaa 
tgcctacata 
agccgacagg 
aaaaaggggg 
•.'abattcacca 
gta*acttcag 
caccttttta 
ggtcaaacag 
atgtgggatg 
tcttggaagt 
taccgctccc 
gaattgcaga 
tgcccctcaa 
gatttcttct 
ttcgaacaac 
agtgaggatg 
ccctttgctt 
aaaacaaaaa 
aacaacctgt 
aactaccgtc 
tttgacaagg 
atgtgggaca 
ttcttatgtg 
agtctggatc 
ataaggacgg 
aggacggtaa 
tttgancagt 
tttttgattg 
ttaaagttct 
ttctggacag 
: acaactaatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84*0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
168b 
1740 

18 ob 

1860 
1920 
198 b 
2040 
2100 
2160 
2220 
2280 
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ccaggtgatt 
tccctcccac 
ataattaatg 
atccccatat 
tgtgaacagc 
aaaaaaaaaa 

: -<210> 245 
<211> 564 
<212> PRT 
<213> Homo 



ttatcaattc 
ccccacagaa 
ataaaacata 
aacaccaaca 
aatttattat 
aaaaaaaaaa 



cagtgttacc 
gattttaagt 
ataataaaca 
gtgtacatgt 
ttttgctcat 
aaaaaaaaa 



atctcctgag 
agggtgactt 
taaataataa 
ttactgtcac 
caaaaaataa 



ttttggtttg 
tttaaataaa 
acaaaattac 
ttttgatatg 
aggatttttt 



taatcttttg 
aaitttattga 
cgagaacccc 
gtttatccag 
ttcacttgaa 



sapiens 



<400> 245 
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